Abstract:Federated learning (FL) is a highly promising collaborative machine learning method that preserves privacy by enabling model training on client nodes (e.g., mobile phones, Internet-of-Things devices) without sharing raw data. However, FL is vulnerable to Byzantine nodes, which can disrupt model performance, render training ineffective, or even manipulate the model by transmitting harmful gradients. In this paper, we propose a Byzantine fault-tolerant FL algorithm called federated learning with trustworthy data and historical information (FLTH). It utilizes a small trusted training dataset at the parameter server to filter out gradient updates from suspicious client nodes during model training, which provides both Byzantine resilience and convergence guarantee. It further introduces a historical information-based credibility assessment scheme such that the client nodes performing poorly over the long-term have a lower impact on the aggregation of gradients, thereby enhancing fault tolerance capability. Additionally, FLTH does not compromise the training efficiency of FL because of its low time complexity. Extensive simulation results show that FLTH achieves higher model accuracy compared to state-of-the-art methods under typical kinds of attack.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is how to resist the influence of Byzantine nodes in Federated Learning (FL). Specifically, Federated Learning is a privacy - protected distributed machine - learning method that allows models to be trained on client nodes (such as mobile phones, Internet - of - Things devices) without sharing the original data. However, Federated Learning is vulnerable to the influence of Byzantine nodes, which may damage the model performance by sending harmful gradients, making the training ineffective and even manipulating the model. To solve this problem, the author proposes a Byzantine - fault - tolerant Federated Learning algorithm with Trustworthy Data and Historical Information (FLTH). The main features of this algorithm are as follows: 1. **Utilizing a trustworthy data set**: Maintain a small trustworthy training data set on the parameter server to filter out the gradient updates of suspicious client nodes, thereby providing Byzantine - fault - tolerance and convergence guarantees. 2. **Introducing a historical - information - based evaluation scheme**: By introducing a credibility - evaluation scheme based on historical information, the influence of client nodes with long - term poor performance on gradient aggregation is reduced, thereby enhancing the fault - tolerance ability. 3. **Low time complexity**: The time complexity of the FLTH algorithm is the same as that of the classic Federated Averaging (FedAvg) algorithm and does not significantly affect the training efficiency. The paper verifies the effectiveness of the FLTH algorithm through extensive simulation experiments. The results show that FLTH can achieve higher model accuracy under typical attacks and has better performance compared to the existing state - of - the - art methods. ### Main contributions - **Proposing the FLTH algorithm**: By combining trustworthy data and historical information, a new Byzantine - fault - tolerant Federated Learning method is provided. - **Low time complexity**: It is proved that the time complexity of the FLTH algorithm is \(O(nd)\) and does not significantly increase the computational overhead. - **Convergence analysis**: The convergence results of the FLTH algorithm are established under mild assumptions. - **Experimental verification**: Through experiments on the MNIST and CIFAR - 10 data sets, the effectiveness and superiority of the FLTH algorithm under multiple attacks are verified. ### Conclusion The FLTH algorithm effectively improves the Byzantine - fault - tolerance ability and model accuracy of the Federated Learning system by using trustworthy data and historical information while maintaining a low time complexity. This provides a new solution for the robustness and security of Federated Learning in practical applications.

Byzantine Fault-Tolerant Federated Learning Based on Trustworthy Data and Historical Information

Secure and Efficient Decentralized Federated Learning with Data Representation Protection

TDFL: Truth Discovery Based Byzantine Robust Federated Learning

Byzantine detection for federated learning under highly non-IID data and majority corruptions

Byzantine-Resilient Secure Aggregation for Federated Learning Without Privacy Compromises

Trustworthy Federated Learning via Blockchain

Efficient, Private and Robust Federated Learning

FedTruth: Byzantine-Robust and Backdoor-Resilient Federated Learning Framework

Fair and Robust Federated Learning via Decentralized and Adaptive Aggregation based on Blockchain

BlockDFL: A Blockchain-based Fully Decentralized Peer-to-Peer Federated Learning Framework

A Four-Pronged Defense Against Byzantine Attacks in Federated Learning

Asynchronous Byzantine Federated Learning

Efficient Byzantine-Robust and Provably Privacy-Preserving Federated Learning

Byzantine-Robust Decentralized Federated Learning

Byzantine-robust Decentralized Federated Learning Via Dual-domain Clustering and Trust Bootstrapping

Byzantine-Robust Federated Learning with Variance Reduction and Differential Privacy

Advancing Hybrid Defense for Byzantine Attacks in Federated Learning

TPFL: A Trustworthy Personalized Federated Learning Framework via Subjective Logic

LoByITFL: Low Communication Secure and Private Federated Learning

SF-CABD: Secure Byzantine fault tolerance federated learning on Non-IID data

Brave: Byzantine-Resilient and Privacy-Preserving Peer-to-Peer Federated Learning