Abstract:We consider the problem of encoding information in a system of N=K+R processors that operate in a decentralized manner, i.e., without a central processor which orchestrates the operation. The system involves K source processors, each holding some data modeled as a vector over a finite field. The remaining R processors are sinks, and each of which requires a linear combination of all data vectors. These linear combinations are distinct from one sink processor to another, and are specified by a generator matrix of a systematic linear error correcting code. To capture the communication cost of decentralized encoding, we adopt a linear network model in which the process proceeds in consecutive communication rounds. In every round, every processor sends and receives one message through each one of its p ports. Moreover, inspired by linear network coding literature, we allow processors to transfer linear combinations of their own data and previously received data. We propose a framework that addresses the decentralized encoding problem on two levels. On the universal level, we provide a solution to the decentralized encoding problem for any possible linear code. On the specific level, we further optimize our solution towards systematic Reed-Solomon codes, as well as their variant, Lagrange codes, for their prevalent use in coded storage and computation systems. Our solutions are based on a newly-defined collective communication operation we call all-to-all encode.

What problem does this paper attempt to address?

The paper attempts to address the problem of how to efficiently encode information in decentralized systems. Specifically, the paper considers a system composed of \(N = K + R\) processors, which operate in a decentralized manner, meaning there is no central processor to coordinate operations. In the system, there are \(K\) source processors, each holding some data (represented as vectors over a finite field). The remaining \(R\) processors are receivers, each requiring a linear combination of all data vectors. These linear combinations are specified by a generator matrix of a systematic linear error-correcting code. To study the communication cost of decentralized encoding, the paper adopts a linear network model where the processing occurs in successive communication rounds. In each round, each processor sends and receives one message through each of its \(p\) ports. Additionally, inspired by the literature on linear network coding, the paper allows processors to transmit linear combinations of their own data and previously received data. The paper proposes a framework to address the decentralized encoding problem on two levels: 1. **General Level**: Provides a solution to the decentralized encoding problem for arbitrary linear codes. 2. **Specific Level**: Further optimizes the solution for systematic Reed-Solomon codes and their variant Lagrange codes, as these codes are widely used in coded storage and computing systems. The main contributions of the paper include: - Proposing a new collective communication operation called "all-to-all encode," which handles the case where each processor is both a source and a receiver. - At the general level, developing a communication-efficient general algorithm that can achieve all-to-all encoding for any square encoding matrix, which is optimal in terms of communication rounds \(C_1\) and near-optimal in terms of the number of communication elements \(C_2\). - At the specific level, providing a series of all-to-all encoding algorithms for Vandermonde, Cauchy-like, and Lagrange matrices, optimizing the performance of the general algorithm in terms of \(C_2\). - Combining the above results, providing a decentralized encoding solution for systematic Reed-Solomon codes, significantly reducing communication costs and extending to Lagrange codes. In summary, the paper aims to reduce communication costs and improve the performance of decentralized systems by proposing efficient decentralized encoding methods.

On the Encoding Process in Decentralized Systems

Scalable Network Coding over Embedded Fields

On Decentralized Linearly Separable Computation With the Minimum Computation Cost

From Centralized to Decentralized Coded Caching

Coded Computing for Resilient Distributed Computing: A Learning-Theoretic Framework

Linear Network Code for Erasure Broadcast Channel with Feedback: Complexity and Algorithms

A Survey of Computation-Driven Data Encoding

Block Circulant Codes with Application to Decentralized Systems

Linear Network Coding for Robust Function Computation and Its Applications in Distributed Computing

Scalable Network Coding for Heterogeneous Devices over Embedded Fields

Communication Cost for Updating Linear Functions when Message Updates are Sparse: Connections to Maximally Recoverable Codes

Coding for Network Coding

Coded Downlink Massive Random Access and a Finite de Finetti Theorem

Network Communication with operators in Dedekind Finite and Stably Finite Rings

Rateless Codes for Near-Perfect Load Balancing in Distributed Matrix-Vector Multiplication

Joint Network and Dirty-Paper Coding for Multi-Way Relay Networks with Pairwise Information Exchange.

Distributed Matrix Computations with Low-weight Encodings

Strategic Communication with Cost-Dependent Decoders via the Gray-Wyner Network

A Fully Programmable Reed-Solomon Decoder on a Multi-Core Processor Platform.

Erasure Coding for Distributed Storage: An Overview

Variable-Rate Linear Network Error Correction MDS Codes