Abstract:Real-time machine learning (ML) has recently attracted significant interest due to its potential to support instantaneous learning, adaptation, and decision making in a wide range of application domains, including self-driving vehicles, intelligent transportation, and industry automation. In this paper, we investigate real-time ML in a federated edge intelligence (FEI) system, an edge computing system that implements federated learning (FL) solutions based on data samples collected and uploaded from decentralized data networks, e.g., Internet-of-Things (IoT) and/or wireless sensor networks. FEI systems often exhibit heterogenous communication and computational resource distribution, as well as non-i.i.d. data samples arrived at different edge servers, resulting in long model training time and inefficient resource utilization. Motivated by this fact, we propose a time-sensitive federated learning (TS-FL) framework to minimize the overall run-time for collaboratively training a shared ML model with desirable accuracy. Training acceleration solutions for both TS-FL with synchronous coordination (TS-FL-SC) and asynchronous coordination (TS-FL-ASC) are investigated. To address the straggler effect in TS-FL-SC, we develop an analytical solution to characterize the impact of selecting different subsets of edge servers on the overall model training time. A server dropping-based solution is proposed to allow some slow-performance edge servers to be removed from participating in the model training if their impact on the resulting model accuracy is limited. A joint optimization algorithm is proposed to minimize the overall time consumption of model training by selecting participating edge servers, the local epoch number (the number of model training iterations per coordination), and the data batch size (the number of data samples for each model training iteration). Motivated by the fact that data samples at the slowest edge server may exhibit special characteristics that cannot be removed from model training, we develop an analytical expression to characterize the impact of both staleness effect of asynchronous coordination and straggler effect of FL on the time consumption of TS-FL-ASC. We propose a load forwarding-based solution that allows a slow edge server to offload part of its training samples to trusted edge servers with higher processing capability. We develop a hardware prototype to evaluate the model training time of a heterogeneous FEI system. Experimental results show that our proposed TS-FL-SC and TS-FL-ASC can provide up to 63% and 28% of reduction, in the overall model training time, respectively, compared with traditional FL solutions.

TinyFEL: Communication, Computation, and Memory Efficient Tiny Federated Edge Learning Via Model Sparse Update

TinyFL: A Lightweight Federated Learning Method with Efficient Memory-and-Communication.

Importance-Aware Data Selection and Resource Allocation in Federated Edge Learning System.

Enhanced Hybrid Hierarchical Federated Edge Learning Over Heterogeneous Networks

LightFed: An Efficient and Secure Federated Edge Learning System on Model Splitting

Theoretical Analysis and Performance Evaluation for Federated Edge Learning with Integrated Sensing, Communication and Computation.

Towards Communication-Efficient and Attack-Resistant Federated Edge Learning for Industrial Internet of Things

An Efficient Asynchronous Federated Learning Protocol for Edge Devices

Online-Learning-Based Fast-Convergent and Energy-Efficient Device Selection in Federated Edge Learning

Semi-Decentralized Federated Edge Learning with Data and Device Heterogeneity

Accelerating Federated Learning with Data and Model Parallelism in Edge Computing

Semi-Decentralized Federated Edge Learning for Fast Convergence on Non-IID Data

EdgeFed: Optimized Federated Learning Based on Edge Computing

Time-Correlated Sparsification for Efficient Over-the-Air Model Aggregation in Wireless Federated Learning

Solving the Federated Edge Learning Participation Dilemma: A Truthful and Correlated Perspective

Lead federated neuromorphic learning for wireless edge artificial intelligence

Joint Resource Optimization for Federated Edge Learning with Integrated Sensing, Communication and Computation

Efficient federated learning on resource-constrained edge devices based on model pruning

Time-sensitive Learning for Heterogeneous Federated Edge Intelligence

Toward Communication-Efficient Federated Learning in the Internet of Things with Edge Computing.

Communication-efficient Federated Edge Learning via Optimal Probabilistic Device Scheduling