Abstract:This paper studies MapReduce-based heterogeneous coded distributed computing (CDC) where, besides different computing capabilities at workers, input files to be accessed by computing jobs have nonuniform popularity. We propose a file placement strategy that can handle an arbitrary number of input files. Furthermore, we design a nested coded shuffling strategy that can efficiently manage the nonuniformity of file popularity to maximize the coded multicasting opportunity. We then formulate the joint optimization of the proposed file placement and nested shuffling design variables to optimize the proposed CDC scheme. To reduce the high computational complexity in solving the resulting mixed-integer linear programming (MILP) problem, we propose a simple two-file-group-based file placement approach to obtain an approximate solution. Numerical results show that the optimized CDC scheme outperforms other alternatives. Also, the proposed two-file-group-based approach achieves nearly the same performance as the conventional branch-and-cut method in solving the MILP problem but with substantially lower computational complexity that is scalable over the number of files and workers. For computing jobs with aggregate target functions that commonly appear in machine learning applications, we propose a heterogeneous compressed CDC (C-CDC) scheme to further improve the shuffling efficiency. The C-CDC scheme uses a local data aggregation technique to compress the data to be shuffled for the shuffling load reduction. We again optimize the proposed C-CDC scheme and explore the two-file-group-based low-complexity approach for an approximate solution. Numerical results show the proposed C-CDC scheme provides a considerable shuffling load reduction over the CDC scheme, and also, the two-file-group-based file placement approach maintains good performance.

Multi-access Distributed Computing Models from Map-Reduce Arrays

Multi-Access Distributed Computing

Massively Parallel Computation via Remote Memory Access

Coded Distributed Computing with Heterogeneous Function Assignments

Wireless MapReduce Arrays for Coded Distributed Computing

Coded Caching Schemes for Multiaccess Topologies via Combinatorial Design

Delay-Optimal Computation Offloading in Large-Scale Multi-Access Edge Computing Using Mean Field Game

MDCRA: A Reconfigurable Accelerator Framework for Multiple Dataflow Lanes

Cascaded Coded Distributed Computing Schemes Based on Placement Delivery Arrays

Design and Optimization of Heterogeneous Coded Distributed Computing with Nonuniform File Popularity

Low Complexity Distributed Computing via Binary Matrices with Extension to Stragglers

Multi-Antenna Coded Caching for Multi-Access Networks with Cyclic Wrap-Around

Combinatorial Multi-Access Coded Caching: Improved Rate-Memory Trade-off with Coded Placement

How to Optimally Allocate Resources for Coded Distributed Computing?

Decentralized Adaptive Resource-Aware Computation Offloading & Caching for Multi-Access Edge Computing Networks.

An Application of Storage-Optimal MatDot Codes for Coded Matrix Multiplication: Fast k-Nearest Neighbors Estimation

Improved DDPG Based Two-Timescale Multi-Dimensional Resource Allocation for Multi-Access Edge Computing Networks

Direct Distributed Memory Access for CMPs

Joint Design of Shuffling and Function Assignment in Heterogeneous Coded Distributed Computing

MRCN: Enhanced Coherence Mechanism for Near Memory Processing Architectures

Distributed Iterative CT Reconstruction Using Multi-Agent Consensus Equilibrium