Efficient asynchronous federated learning with sparsification and quantization

Juncheng Jia,Ji Liu,Chendi Zhou,Hao Tian,Mianxiong Dong,Dejing Dou
DOI: https://doi.org/10.1002/cpe.8002
2024-01-26
Concurrency and Computation Practice and Experience
Abstract:Summary While data is distributed in multiple edge devices, federated learning (FL) is attracting more and more attention to collaboratively train a machine learning model without transferring raw data. FL generally exploits a parameter server and a large number of edge devices during the whole process of the model training, while several devices are selected in each round. However, straggler devices may slow down the training process or even make the system crash during training. Meanwhile, other idle edge devices remain unused. As the bandwidth between the devices and the server is relatively low, the communication of intermediate data becomes a bottleneck. In this article, we propose time‐efficient asynchronous federated learning with sparsification and quantization, that is, TEASQ‐Fed. TEASQ‐Fed can fully exploit edge devices to asynchronously participate in the training process by actively applying for tasks. We utilize control parameters to choose an appropriate number of parallel edge devices, which simultaneously execute the training tasks. In addition, we introduce a caching mechanism and weighted averaging with respect to model staleness to further improve the accuracy. Furthermore, we propose a sparsification and quantitation approach to compress the intermediate data to accelerate the training. The experimental results reveal that TEASQ‐Fed improves the accuracy (up to 16.67% higher) while accelerating the convergence of model training (up to twice faster).
computer science, theory & methods, software engineering
What problem does this paper attempt to address?