Abstract:Federated Learning (FL) has emerged as a powerful paradigm for training Machine Learning (ML), particularly Deep Learning (DL) models on multiple devices or servers while maintaining data localized at owners' sites. Without centralizing data, FL holds promise for scenarios where data integrity, privacy and security and are critical. However, this decentralized training process also opens up new avenues for opponents to launch unique attacks, where it has been becoming an urgent need to understand the vulnerabilities and corresponding defense mechanisms from a learning algorithm perspective. This review paper takes a comprehensive look at malicious attacks against FL, categorizing them from new perspectives on attack origins and targets, and providing insights into their methodology and impact. In this survey, we focus on threat models targeting the learning process of FL systems. Based on the source and target of the attack, we categorize existing threat models into four types, Data to Model (D2M), Model to Data (M2D), Model to Model (M2M) and composite attacks. For each attack type, we discuss the defense strategies proposed, highlighting their effectiveness, assumptions and potential areas for improvement. Defense strategies have evolved from using a singular metric to excluding malicious clients, to employing a multifaceted approach examining client models at various phases. In this survey paper, our research indicates that the to-learn data, the learning gradients, and the learned model at different stages all can be manipulated to initiate malicious attacks that range from undermining model performance, reconstructing private local data, and to inserting backdoors. We have also seen these threat are becoming more insidious. While earlier studies typically amplified malicious gradients, recent endeavors subtly alter the least significant weights in local models to bypass defense measures. This literature review provides a holistic understanding of the current FL threat landscape and highlights the importance of developing robust, efficient, and privacy-preserving defenses to ensure the safe and trusted adoption of FL in real-world applications. The categorized bibliography can be found at: https://github.com/Rand2AI/Awesome-Vulnerability-of-Federated-Learning .

Robust Federated Learning Mitigates Client-side Training Data Distribution Inference Attacks

Privacy-Preserving Federated Learning Against Label-Flipping Attacks on Non-IID Data

Federated Learning Under Attack: Exposing Vulnerabilities through Data Poisoning Attacks in Computer Networks

Network-Level Adversaries in Federated Learning

Eavesdrop the Composition Proportion of Training Labels in Federated Learning

FedDefender: Client-Side Attack-Tolerant Federated Learning

Attacks on fairness in Federated Learning

Towards Understanding Adversarial Transferability in Federated Learning

A federated learning attack method based on edge collaboration via cloud

On the Vulnerability of Backdoor Defenses for Federated Learning

Data Poisoning Attacks Against Federated Learning Systems

Federated Learning With Unreliable Clients: Performance Analysis and Mechanism Design

Adaptive Selection of Loss Function for Federated Learning Clients Under Adversarial Attacks

Defending against gradient inversion attacks in federated learning via statistical machine unlearning

Defending Against Data Reconstruction Attacks in Federated Learning: An Information Theory Approach

Beyond Model Splitting: Preventing Label Inference Attacks in Vertical Federated Learning with Dispersed Training

A survey on vulnerability of federated learning: A learning algorithm perspective

A Four-Pronged Defense Against Byzantine Attacks in Federated Learning

Robust Federated Learning against both Data Heterogeneity and Poisoning Attack via Aggregation Optimization

Efficient, Private and Robust Federated Learning

FedTruth: Byzantine-Robust and Backdoor-Resilient Federated Learning Framework