Abstract:Originated from distributed learning, federated learning enables privacy-preserved collaboration on a new abstracted level by sharing the model parameters only. While the current research mainly focuses on optimizing learning algorithms and minimizing communication overhead left by distributed learning, there is still a considerable gap when it comes to the real implementation on mobile devices. In this article, we start with an empirical experiment to demonstrate computation heterogeneity is a more pronounced bottleneck than communication on the current generation of battery-powered mobile devices, and the existing methods are haunted by mobile stragglers. Further, non-identically distributed data across the mobile users makes the selection of participants critical to the accuracy and convergence. To tackle the computational and statistical heterogeneity, we utilize data as a tuning knob and propose two efficient polynomial-time algorithms to schedule different workloads on various mobile devices, when data is identically or non-identically distributed. For identically distributed data, we combine partitioning and linear bottleneck assignment to achieve near-optimal training time without accuracy loss. For non-identically distributed data, we convert it into an average cost minimization problem and propose a greedy algorithm to find a reasonable balance between computation time and accuracy. We also establish an offline profiler to quantify the runtime behavior of different devices, which serves as the input to the scheduling algorithms. We conduct extensive experiments on a mobile testbed with two datasets and up to 20 devices. Compared with the common benchmarks, the proposed algorithms achieve 2-100× speedup epoch-wise, 2–7 percent accuracy gain and boost the convergence rate by more than 100 percent on CIFAR10.

FLASH: Heterogeneity-Aware Federated Learning at Scale

Towards Efficient Scheduling of Federated Mobile Devices under Computational and Statistical Heterogeneity

Hierarchical Federated Learning: the Interplay of User Mobility and Data Heterogeneity

Characterizing Impacts of Heterogeneity in Federated Learning Upon Large-Scale Smartphone Data

FLASH: Federated Learning Across Simultaneous Heterogeneities

HiFlash: Communication-Efficient Hierarchical Federated Learning with Adaptive Staleness Control and Heterogeneity-aware Client-Edge Association

Heterogeneous Federated Learning: State-of-the-art and Research Challenges

Asynchronous Federated Learning on Heterogeneous Devices: A Survey

A Survey on Heterogeneous Federated Learning

ParallelSFL: A Novel Split Federated Learning Framework Tackling Heterogeneity Issues

HeteroFL: Computation and Communication Efficient Federated Learning for Heterogeneous Clients

Advances in Robust Federated Learning: Heterogeneity Considerations

HeteroSwitch: Characterizing and Taming System-Induced Data Heterogeneity in Federated Learning

Equalized Aggregation for Heterogeneous Federated Mobile Edge Learning

FedShift: Tackling Dual Heterogeneity Problem of Federated Learning via Weight Shift Aggregation

Completely Heterogeneous Federated Learning

Analysis and Optimization of Wireless Federated Learning with Data Heterogeneity

Federated learning with workload-aware client scheduling in heterogeneous systems

FlocOff: Data Heterogeneity Resilient Federated Learning with Communication-Efficient Edge Offloading

Heterogeneity-Aware Resource Allocation and Topology Design for Hierarchical Federated Edge Learning

Data-Heterogeneous Hierarchical Federated Learning with Mobility