TinyFEL: Communication, Computation, and Memory Efficient Tiny Federated Edge Learning Via Model Sparse Update

Qimei Chen,Han Cheng,Yipeng Liang,Guangxu Zhu,Ming Li,Hao Jiang
DOI: https://doi.org/10.1109/jiot.2024.3499375
IF: 10.6
2024-01-01
IEEE Internet of Things Journal
Abstract:Federated Edge Learning (FEL) is regarded as a promising distributed machine learning paradigm to reduce transmission latency and resources as well as preserve raw data privacy by collaboratively training local deep learning models across multiple edge devices. However, with the development of Artificial Intelligence (AI) technologies, the size of neural network models grows exponentially with their parameters to meet variable application requirements, which poses significant challenges to the computation, communication, and memory abilities of edge devices. Existing designs typically focus on either communication or computation efficiency without caring each device’s memory ability. To deal with the above issues, we first introduce a novel model sparse update enabled Tiny Federated Edge Learning (TinyFEL) architecture, which terminates the back-propagation early in local model training processes. Therefore, the proposed TinyFEL can reduce local memory occupation and lessen the communication-and-computation burden. Furthermore, we propose a parameter splitting mechanism instead of transmitting the full model, only a part of updated layers of parameters is transmitted for aggregation, which significantly reduced the communication overheads. Thereafter, we develop a communication and computation latency minimization problem to accelerate the training of TinyFEL. To this end, we theoretically analyze the convergence performance of TinyFEL, which unveils the mathematical relationship among sparse update ratio assignment, device selection, and learning performance. Then, a joint sparse update ratio assignment, device selection, and resource allocation strategy is introduced based on the Alternating Direction Method of Multipliers (ADMM) and Block Coordinate Descent (BCD) algorithms. Numerical results indicate that our proposed TinyFEL can reduce training memory occupation by over 40% than the traditional FEL at the cost of negligible accuracy loss.
What problem does this paper attempt to address?