Fundamental Limits of Distributed Linearly Separable Computation under Cyclic Assignment

Wenbo Huang,Kai Wan,Hua Sun,Mingyue Ji,Robert Caiming Qiu,Giuseppe Caire

2024-02-19

Abstract:This paper studies the master-worker distributed linearly separable computation problem, where the considered computation task, referred to as linearly separable function, is a typical linear transform model widely used in cooperative distributed gradient coding, real-time rendering, linear transformers, etc. %A master asks $\Nsf$ distributed workers to compute a linearly separable function from $\Ksf$ datasets. The computation task on $\Ksf$ datasets can be expressed as $\Ksf_{\rm c}$ linear combinations of $\Ksf$ messages, where each message is the output of an individual function on one dataset. Straggler effect is also considered, such that from the answers of any $\Nsf_{\rm r}$ of the $\Nsf$ distributed workers, the master should accomplish the task. The computation cost is defined as the number of datasets assigned to each worker, while the communication cost is defined as the number of (coded) messages that should be received. The objective is to characterize the optimal tradeoff between the computation and communication costs. The problem has remained so far open, even under the cyclic data assignment.Since in fact various distributed computing schemes were proposed in the literature under the cyclic data assignment, with this paper we close the problem for the cyclic assignment. This paper proposes a new computing scheme with the cyclic assignment based on the concept of interference alignment, by treating each message which cannot be computed by a worker as an interference from this worker. Under the cyclic assignment, the proposed computing scheme is then proved to be optimal when $\Nsf=\Ksf$ and be order optimal within a factor of $2$ otherwise.

Information Theory

What problem does this paper attempt to address?

The problem that this paper attempts to solve is how to optimize the trade - off between computational cost and communication cost in distributed linearly separable computing while considering the "dragging - effect" (i.e., some worker nodes are slow or fail in computing). Specifically, the paper studies the master - slave distributed linearly separable computing problem, where the computing task can be represented as $K$ linear combinations on $K$ data sets, and each linear combination corresponds to the output of a function on a single data set. The goal is to minimize the number of data sets assigned to each worker node (computational cost) and the number of messages to be received (communication cost) under the premise of ensuring the completion of the task. The main contribution of the paper lies in proposing a new computing scheme, which is based on the interference alignment strategy and achieves an optimal or near - optimal computational - communication cost trade - off under cyclic allocation conditions. In particular, when $N = K$, the proposed scheme is proven to be completely optimal; while in other cases, the performance of the scheme is also within twice the optimal solution. In addition, experimental results show that this scheme is significantly superior to existing methods in practical applications. For example, tests on Tencent Cloud show that the time of the entire distributed computing process is reduced by up to 72.8%.

Fundamental Limits of Distributed Linearly Separable Computation under Cyclic Assignment

On Decentralized Linearly Separable Computation With the Minimum Computation Cost

Coded Computation across Shared Heterogeneous Workers with Communication Delay

On the Capacity of Privacy-Preserving and Straggler-Robust Distributed Coded Computing

Heterogeneous Coded Computation Across Heterogeneous Workers.

Perfect Multi-User Distributed Computing

Improved Computation-Communication Trade-Off for Coded Distributed Computing using Linear Dependence of Intermediate Values

Coded Distributed Computing with Wireless Shared Heterogeneous Workers

Linear Network Coding for Robust Function Computation and Its Applications in Distributed Computing

Relay Selection and Load Allocation for LT Coded Distributed Computing in Two- Hop Heterogeneous Computation Network

Design and Performance Analysis of Partial Computation Output Schemes for Accelerating Coded Machine Learning

Lagrange Coded Computing: Optimal Design For Resiliency, Security, And Privacy

Communication-Efficient Task Scheduling for Real-Time Distributed Computing.

A Scalable Framework for Wireless Distributed Computing.

Distributed Computing with Heterogeneous Servers

New Results on the Computation-Communication Tradeoff for Heterogeneous Coded Distributed Computing

How to Optimally Allocate Resources for Coded Distributed Computing?

Coded Distributed Computing with Heterogeneous Function Assignments

Block Allocation of Systematic Coded Distributed Computing in Heterogeneous Straggling Networks

Heterogeneous Coded Distributed Computing: Joint Design of File Allocation and Function Assignment

Incentive Mechanism Design for Distributed Coded Machine Learning