Boosting Federated Learning in Resource-Constrained Networks

Mohamed Yassine Boukhari,Akash Dhasade,Anne-Marie Kermarrec,Rafael Pires,Othmane Safsafi,Rishi Sharma
2023-12-11
Abstract:Federated learning (FL) enables a set of client devices to collaboratively train a model without sharing raw data. This process, though, operates under the constrained computation and communication resources of edge devices. These constraints combined with systems heterogeneity force some participating clients to perform fewer local updates than expected by the server, thus slowing down convergence. Exhaustive tuning of hyperparameters in FL, furthermore, can be resource-intensive, without which the convergence is adversely affected. In this work, we propose GeL, the guess and learn algorithm. GeL enables constrained edge devices to perform additional learning through guessed updates on top of gradient-based steps. These guesses are gradientless, i.e., participating clients leverage them for free. Our generic guessing algorithm (i) can be flexibly combined with several state-of-the-art algorithms including FedProx, FedNova or FedYogi; and (ii) achieves significantly improved performance when the learning rates are not best tuned. We conduct extensive experiments and show that GeL can boost empirical convergence by up to 40% in resource-constrained networks while relieving the need for exhaustive learning rate tuning.
Machine Learning,Distributed, Parallel, and Cluster Computing
What problem does this paper attempt to address?
The paper primarily aims to address the issues encountered in Federated Learning (FL) within resource-constrained networks, specifically focusing on the following aspects: 1. **Training efficiency under resource constraints**: Due to the limited computational and communication capabilities of edge devices (clients), some participating clients are unable to complete the expected number of local updates, thereby affecting the overall training convergence speed. 2. **Challenges brought by system heterogeneity**: There are differences in computational capabilities among clients, and this heterogeneity leads to varying numbers of local updates that different clients can perform, further exacerbating the imbalance and slow convergence issues during the training process. 3. **Difficulties in hyperparameter tuning**: Especially in distributed data environments, adjusting hyperparameters such as learning rates is both expensive and time-consuming. Even with some adaptive optimizers to broaden the range of well-performing parameter values, grid search is still required for different client and server learning rates. To address the above issues, the paper proposes a new algorithm called "Guess and Learn" (GeL). The core idea of GeL is to utilize the momentum information accumulated by clients to perform additional "guess" updates. These updates do not require extra gradient computation costs and can thus be considered "free" update steps. In this way, GeL can improve training speed without increasing computational burden and reduce the need for precise learning rate tuning. Specifically, the main contributions of GeL include: - Proposing a novel algorithm, GeL, which can compensate for the limitations of resource-constrained devices and system capability variations through "guess" updates. - Providing a convergence analysis of GeL based on existing theoretical frameworks and offering insights into the guess updates in GeL. - Experimental results show that GeL can significantly accelerate the convergence speed of various tasks, improving communication round efficiency by up to 30%. - The paper also demonstrates that GeL can be combined with existing federated learning algorithms such as FedProx, FedNova, and FedYogi to further enhance their performance. - Finally, the paper shows that GeL can achieve good performance even with unrefined learning rate settings, alleviating the need for learning rate tuning.