Abstract:Federated learning (FL) has attracted widespread attention in the Internet of Things domain recently. With FL, multiple distributed devices can cooperatively train a global model by transmitting model updates without disclosing the original data. However, the distributed nature of FL makes it vulnerable to data poisoning attacks. In practice, malicious clients can launch the label-flipping attack (LFA) by simply tampering with the labels of local data, thus causing the global model to misclassify the samples of a selected class as the target class. Although some defense mechanisms have been proposed, they rely on specific assumptions about data distribution, and their performance degrades significantly when the data on clients are non-IID. Besides, most existing methods require clients to upload model updates in plaintext so that the server can identify and remove the malicious updates. But, direct transmission of model updates may still reveal private information. Considering these issues, we develop a label-flipping-robust and privacy-preserving FL (LFR-PPFL) algorithm, which is applicable to both independent and identically distributed (IID) and non-IID data. We first propose a detection method based on temporal analysis on cosine similarity to distinguish malicious clients from benign clients. Then, we propose a privacy-preserving computation protocol based on homomorphic encryption to implement this detection method and perform federated aggregation while protecting the privacy of clients. Besides, a detailed theoretical analysis is given to demonstrate the privacy guarantee of the proposed protocol. Experimental results on real-world data sets show that the proposed algorithm can effectively defend against LFAs under various data distributions.

Federated Learning with Extreme Label Skew: A Data Extension Approach

Federated Learning with Label Distribution Skew via Logits Calibration.

FediOS: Decoupling Orthogonal Subspaces for Personalization in Feature-skew Federated Learning

FLea: Addressing Data Scarcity and Label Skew in Federated Learning via Privacy-preserving Feature Augmentation

Privacy-Preserving Federated Learning Against Label-Flipping Attacks on Non-IID Data

Optimizing Federated Learning on Non-IID Data Using Local Shapley Value.

Stabilizing and Improving Federated Learning with Non-IID Data and Client Dropout

CalFAT: Calibrated Federated Adversarial Training with Label Skewness

FedRS: Federated Learning with Restricted Softmax for Label Distribution Non-IID Data

Dataset Distillation-based Hybrid Federated Learning on Non-IID Data

Federated Learning for distribution skewed data using sample weights

Federated Learning with Label-Masking Distillation

A Simple Data Augmentation for Feature Distribution Skewed Federated Learning

FedDUAL: A Dual-Strategy with Adaptive Loss and Dynamic Aggregation for Mitigating Data Heterogeneity in Federated Learning

Fair Federated Learning under Domain Skew with Local Consistency and Domain Diversity

Tackling Feature Skew in Heterogeneous Federated Learning with Semantic Enhancement

An Aggregation-Free Federated Learning for Tackling Data Heterogeneity

Federated learning on non-IID and long-tailed data via dual-decoupling

Enhancing Federated Learning Convergence with Dynamic Data Queue and Data Entropy-driven Participant Selection

A Joint Training-Calibration Framework for Test-Time Personalization with Label Shift in Federated Learning

FedSLD: Federated Learning with Shared Label Distribution for Medical Image Classification