FedCAP: Robust Federated Learning via Customized Aggregation and Personalization

Youpeng Li,Xinda Wang,Fuxun Yu,Lichao Sun,Wenbin Zhang,Xuyu Wang
2024-10-17
Abstract:Federated learning (FL), an emerging distributed machine learning paradigm, has been applied to various privacy-preserving scenarios. However, due to its distributed nature, FL faces two key issues: the non-independent and identical distribution (non-IID) of user data and vulnerability to Byzantine threats. To address these challenges, in this paper, we propose FedCAP, a robust FL framework against both data heterogeneity and Byzantine attacks. The core of FedCAP is a model update calibration mechanism to help a server capture the differences in the direction and magnitude of model updates among clients. Furthermore, we design a customized model aggregation rule that facilitates collaborative training among similar clients while accelerating the model deterioration of malicious clients. With a Euclidean norm-based anomaly detection mechanism, the server can quickly identify and permanently remove malicious clients. Moreover, the impact of data heterogeneity and Byzantine attacks can be further mitigated through personalization on the client side. We conduct extensive experiments, comparing multiple state-of-the-art baselines, to demonstrate that FedCAP performs well in several non-IID settings and shows strong robustness under a series of poisoning attacks.
Machine Learning,Artificial Intelligence,Cryptography and Security
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to deal with data heterogeneity and Byzantine attacks simultaneously in Federated Learning (FL). Specifically, the paper points out that Federated Learning faces two key challenges: 1. **Data Heterogeneity**: In practical applications, the data distributions of different clients may be very different (non - IID), which makes it very difficult to train a single global model to meet the needs of all clients. For example, in Google's next - word prediction task, due to the diversity of users' language habits and regional cultures, training a general - purpose model is very challenging. 2. **Byzantine Threats**: Federated Learning systems are vulnerable to attacks by malicious clients. These clients may upload arbitrary model updates, which can seriously degrade the performance of the model. This type of attack is called an untargeted poisoning attack. To address these two challenges, the paper proposes FedCAP (Federated Learning via Customized Aggregation and Personalization), a robust Federated Learning framework aimed at improving the performance of the model under data heterogeneity and Byzantine attacks. ### Main Contributions 1. **Proposing FedCAP**: A robust Federated Learning framework that can adapt to various non - IID settings and different types of attacks. 2. **Model Update Calibration Mechanism**: By calibrating the uploaded model updates, effectively capture the differences in direction and magnitude between benign clients and malicious clients. 3. **Custom - made Model Aggregation Rules**: Promote collaboration among similar clients, while accelerating the deterioration of the models of malicious clients, helping the server to identify and permanently remove malicious clients. 4. **Extensive Experimental Verification**: The experimental results show that FedCAP outperforms the existing state - of - the - art (SOTA) Federated Learning baseline methods in terms of model accuracy and robustness. ### Key Components of the Solution 1. **Model Calibration Mechanism**: By calibrating the uploaded model updates, help the server distinguish between malicious and benign updates in non - IID settings. 2. **Custom - made Aggregation Rules**: According to the similarity relationships between clients, assign a customized model to each client to better match its data distribution. 3. **Anomaly Detection Mechanism**: Based on the Euclidean norm of the calibrated model updates, quickly identify and permanently remove malicious clients. 4. **Personalized Training Module**: Further mitigate the effects of data heterogeneity and attacks by conducting personalized training on the client side. ### Experimental Results The paper verifies the effectiveness of FedCAP through extensive experiments. In particular, under different non - IID settings and multiple types of attacks, FedCAP performs excellently and significantly improves the model's accuracy and robustness. ### Conclusion By combining model calibration, custom - made aggregation, and anomaly detection mechanisms, FedCAP effectively solves the problems of data heterogeneity and Byzantine attacks in Federated Learning, providing a powerful solution for Federated Learning in practical applications.