Byzantine Fault-Tolerant Federated Learning Based on Trustworthy Data and Historical Information

Xujiang Luo,Bin Tang
DOI: https://doi.org/10.3390/electronics13081540
IF: 2.9
2024-04-19
Electronics
Abstract:Federated learning (FL) is a highly promising collaborative machine learning method that preserves privacy by enabling model training on client nodes (e.g., mobile phones, Internet-of-Things devices) without sharing raw data. However, FL is vulnerable to Byzantine nodes, which can disrupt model performance, render training ineffective, or even manipulate the model by transmitting harmful gradients. In this paper, we propose a Byzantine fault-tolerant FL algorithm called federated learning with trustworthy data and historical information (FLTH). It utilizes a small trusted training dataset at the parameter server to filter out gradient updates from suspicious client nodes during model training, which provides both Byzantine resilience and convergence guarantee. It further introduces a historical information-based credibility assessment scheme such that the client nodes performing poorly over the long-term have a lower impact on the aggregation of gradients, thereby enhancing fault tolerance capability. Additionally, FLTH does not compromise the training efficiency of FL because of its low time complexity. Extensive simulation results show that FLTH achieves higher model accuracy compared to state-of-the-art methods under typical kinds of attack.
engineering, electrical & electronic,computer science, information systems,physics, applied
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to resist the influence of Byzantine nodes in Federated Learning (FL). Specifically, Federated Learning is a privacy - protected distributed machine - learning method that allows models to be trained on client nodes (such as mobile phones, Internet - of - Things devices) without sharing the original data. However, Federated Learning is vulnerable to the influence of Byzantine nodes, which may damage the model performance by sending harmful gradients, making the training ineffective and even manipulating the model. To solve this problem, the author proposes a Byzantine - fault - tolerant Federated Learning algorithm with Trustworthy Data and Historical Information (FLTH). The main features of this algorithm are as follows: 1. **Utilizing a trustworthy data set**: Maintain a small trustworthy training data set on the parameter server to filter out the gradient updates of suspicious client nodes, thereby providing Byzantine - fault - tolerance and convergence guarantees. 2. **Introducing a historical - information - based evaluation scheme**: By introducing a credibility - evaluation scheme based on historical information, the influence of client nodes with long - term poor performance on gradient aggregation is reduced, thereby enhancing the fault - tolerance ability. 3. **Low time complexity**: The time complexity of the FLTH algorithm is the same as that of the classic Federated Averaging (FedAvg) algorithm and does not significantly affect the training efficiency. The paper verifies the effectiveness of the FLTH algorithm through extensive simulation experiments. The results show that FLTH can achieve higher model accuracy under typical attacks and has better performance compared to the existing state - of - the - art methods. ### Main contributions - **Proposing the FLTH algorithm**: By combining trustworthy data and historical information, a new Byzantine - fault - tolerant Federated Learning method is provided. - **Low time complexity**: It is proved that the time complexity of the FLTH algorithm is \(O(nd)\) and does not significantly increase the computational overhead. - **Convergence analysis**: The convergence results of the FLTH algorithm are established under mild assumptions. - **Experimental verification**: Through experiments on the MNIST and CIFAR - 10 data sets, the effectiveness and superiority of the FLTH algorithm under multiple attacks are verified. ### Conclusion The FLTH algorithm effectively improves the Byzantine - fault - tolerance ability and model accuracy of the Federated Learning system by using trustworthy data and historical information while maintaining a low time complexity. This provides a new solution for the robustness and security of Federated Learning in practical applications.