Evaluating and Enhancing the Robustness of Federated Learning System Against Realistic Data Corruption

Chen Yang,Yuanchun Li,Hao Lu,Jinliang Yuan,Qibo Sun,Shangguang Wang,Mengwei Xu
DOI: https://doi.org/10.1109/issre59848.2023.00050
2023-01-01
Abstract:Federated learning (FL) has emerged as a prominent paradigm enabling collaborative model training without transmitting local data, thereby safeguarding data privacy. However, the practical implementation of FL systems on these devices faces a significant challenge: the heterogeneous corruption of data on individual clients, leading to unanticipated accuracy degradation during real-world deployment. In this work, we first introduce a realistic data corruption simulation framework to test the robustness of FL systems. In this framework, an in-depth analysis of potential data corruption patterns occurring on devices is conducted, followed by the construction of individual datasets with varying corruption types and degrees. Such data corruption results in the robustness degradation of conventional FL protocol (FedAVG) significantly higher than centralized learning (CL). Atop this key observation, we propose an adaptive FL protocol that emulates the CL training process. The protocol leverages imbalanced client data sampling to mitigate the negative impact of data corruption. Furthermore, a hybrid aggregation strategy is designed to accelerate model convergence and reduce additional communication overhead. Extensive experiments validate the effectiveness of our approach in enhancing the robustness of FL systems against client data corruption, which achieves up to 12% higher converge accuracy than FedAVG-based systems with acceptable overhead.
What problem does this paper attempt to address?