Abstract:In this article, we investigate random client selection in the context of horizontal federated learning (FL), whereby only a randomly selected subset of clients transmit their model updates to the server instead of yielding all clients involved. Many researchers have demonstrated that clustering-based client selection constitutes a simple yet efficacious approach to the identification of those clients possessing representative gradient information. Despite the extensive body of research on modified selection methodologies, the majority of prior work is predicated upon the assumption of consistently effective clustering. However, raw gradient-based clustering methods are subject to several challenges: 1) poor effectiveness, the raw high-dimensional gradient of a client is too complex to serve as an appropriate feature for grouping, resulting in large intra-cluster distances and 2) fluctuating effectiveness, due to inherent limitations in clustering, the effectiveness can vary significantly, leading to clusters with diverse levels of heterogeneity. In practice, suboptimal and inconsistent clustering effects can result in clusters with low intra-cluster similarity among clients. The selection of clients from such clusters may impede the overall convergence of training. In this article, we propose, a novel client selection scheme to accelerate the FL convergence by variance reduction. The main idea of is to stratify a compressed model update in order to ensure an excellent grouping effect, and at the same time reduce the cross-client variance by re-allocating the sample chance among different groups based on their diverse heterogeneity. It strikes this convergence acceleration by paying more attention to those client groups with relatively low similarity and then improving the representativeness of the selected subset as much as possible. Theoretically, we demonstrate the critical improvement of the proposed scheme in variance reduction and present equivalence conditions among different client selection methods. We also present the tighter convergence guarantee of the proposed method thanks to the variance reduction. Experimental results confirm the exceeded efficiency of our approach compared to alternatives.

Node Selection Toward Faster Convergence for Federated Learning on Non-IID Data

FedPSE: Personalized Sparsification with Element-wise Aggregation for Federated Learning

Optimizing Federated Learning on Non-IID Data Using Local Shapley Value.

FedPD: A Federated Learning Framework with Optimal Rates and Adaptivity to Non-IID Data.

Accelerating Federated Learning by Selecting Beneficial Herd of Local Gradients

Enhancing Convergence in Federated Learning: A Contribution-Aware Asynchronous Approach

Hierarchical Federated Learning with Adaptive Clustering on Non-IID Data

Federated Learning on Non-Independent and Identically Distributed Data

FedPD: A Federated Learning Framework With Adaptivity to Non-IID Data

Preconditioned Federated Learning

FedAgg: Adaptive Federated Learning with Aggregated Gradients

FedPA: An adaptively partial model aggregation strategy in Federated Learning

Data Selection for Efficient Model Update in Federated Learning

FedLion: Faster Adaptive Federated Optimization with Fewer Communication

FedSTS: A Stratified Client Selection Framework for Consistently Fast Federated Learning

FedEP: Tailoring Attention to Heterogeneous Data Distribution with Entropy Pooling for Decentralized Federated Learning

Efficient Federated Learning via Local Adaptive Amended Optimizer with Linear Speedup

Stabilizing and Accelerating Federated Learning on Heterogeneous Data With Partial Client Participation

Enhancing Edge-Assisted Federated Learning with Asynchronous Aggregation and Cluster Pairing

Faster Convergence on Heterogeneous Federated Edge Learning: An Adaptive Clustered Data Sharing Approach

Achieving Linear Speedup with Partial Worker Participation in Non-IID Federated Learning