An Element-Wise Weights Aggregation Method for Federated Learning

Yi Hu,Hanchi Ren,Chen Hu,Jingjing Deng,Xianghua Xie
DOI: https://doi.org/10.1109/ICDMW60847.2023.00031
2024-04-24
Abstract:Federated learning (FL) is a powerful Machine Learning (ML) paradigm that enables distributed clients to collaboratively learn a shared global model while keeping the data on the original device, thereby preserving privacy. A central challenge in FL is the effective aggregation of local model weights from disparate and potentially unbalanced participating clients. Existing methods often treat each client indiscriminately, applying a single proportion to the entire local model. However, it is empirically advantageous for each weight to be assigned a specific proportion. This paper introduces an innovative Element-Wise Weights Aggregation Method for Federated Learning (EWWA-FL) aimed at optimizing learning performance and accelerating convergence speed. Unlike traditional FL approaches, EWWA-FL aggregates local weights to the global model at the level of individual elements, thereby allowing each participating client to make element-wise contributions to the learning process. By taking into account the unique dataset characteristics of each client, EWWA-FL enhances the robustness of the global model to different datasets while also achieving rapid convergence. The method is flexible enough to employ various weighting strategies. Through comprehensive experiments, we demonstrate the advanced capabilities of EWWA-FL, showing significant improvements in both accuracy and convergence speed across a range of backbones and benchmarks.
Machine Learning,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
### What problem does this paper attempt to solve? This paper aims to solve a core challenge in federated learning (FL): how to effectively aggregate local model weights from different clients. Specifically, the paper focuses on the following issues: 1. **Heterogeneity of data distribution**: In federated learning, each client's data set may have different statistical characteristics, resulting in non - independent and identically distributed (Non - IID) data distribution. This heterogeneity complicates the aggregation of local model weights because local models of different clients may capture different data patterns. 2. **Limitations of existing methods**: Existing federated learning methods usually perform uniform proportional aggregation on the local model weights of each client, that is, apply the same aggregation proportion to the entire model. This method ignores the difference in the contribution of each parameter on different data sets and may lead to poor performance of the global model. 3. **Optimizing learning performance and accelerating convergence speed**: The paper proposes a new element - wise weights aggregation method for federated learning (EWWA - FL), aiming to optimize learning performance and accelerate convergence speed by assigning a specific proportion to each parameter. ### Specific solutions The core ideas of the EWWA - FL method are: - **Element - level aggregation**: Different from traditional model - level aggregation, EWWA - FL aggregates local model weights at the element level. This means that each parameter will obtain a specific aggregation proportion according to its performance in local training. - **Adaptive aggregation**: By considering the unique characteristics of each client's data set, EWWA - FL can handle the contributions of different data sets more flexibly, thereby enhancing the robustness and generalization ability of the global model. - **Experimental verification**: Through extensive experiments, the paper shows the superior performance of EWWA - FL on multiple neural network architectures and benchmark data sets, especially showing significant improvement on non - independent and identically distributed (Non - IID) data sets. ### Experimental results The experimental results show that EWWA - FL achieves better performance than existing methods on a variety of neural network architectures and benchmark data sets, especially when dealing with large - scale category data sets such as CIFAR - 100 and ILSVRC2012. For example, when using the ResNet - 20 architecture to process the CIFAR - 100 data set, the accuracy rate of EWWA - FL reaches 64.16%, which is 9.53% higher than that of the best existing method FedAvg. In conclusion, this paper effectively solves the problems of data distribution heterogeneity and limitations of existing methods in federated learning by introducing the EWWA - FL method, and significantly improves learning performance and convergence speed.