Federated Unlearning: A Survey on Methods, Design Guidelines, and Evaluation Metrics

Nicolò Romandini,Alessio Mora,Carlo Mazzocca,Rebecca Montanari,Paolo Bellavista
2024-02-16
Abstract:Federated Learning (FL) enables collaborative training of a Machine Learning (ML) model across multiple parties, facilitating the preservation of users' and institutions' privacy by keeping data stored locally. Instead of centralizing raw data, FL exchanges locally refined model parameters to build a global model incrementally. While FL is more compliant with emerging regulations such as the European General Data Protection Regulation (GDPR), ensuring the right to be forgotten in this context - allowing FL participants to remove their data contributions from the learned model - remains unclear. In addition, it is recognized that malicious clients may inject backdoors into the global model through updates, e.g. to generate mispredictions on specially crafted data examples. Consequently, there is the need for mechanisms that can guarantee individuals the possibility to remove their data and erase malicious contributions even after aggregation, without compromising the already acquired "good" knowledge. This highlights the necessity for novel Federated Unlearning (FU) algorithms, which can efficiently remove specific clients' contributions without full model retraining. This survey provides background concepts, empirical evidence, and practical guidelines to design/implement efficient FU schemes. Our study includes a detailed analysis of the metrics for evaluating unlearning in FL and presents an in-depth literature review categorizing state-of-the-art FU contributions under a novel taxonomy. Finally, we outline the most relevant and still open technical challenges, by identifying the most promising research directions in the field.
Machine Learning,Cryptography and Security
What problem does this paper attempt to address?
### Problems Addressed by the Paper This paper primarily explores the issue of implementing the "right to be forgotten" in the context of Federated Learning (FL) and proposes a method called Federated Unlearning (FU) to address this challenge. #### Core Issues - **How to implement the "right to be forgotten" in an FL environment**: Even though data remains locally stored and is not directly shared in FL, ensuring that users can delete their contributions to the global model remains an unresolved issue. This is particularly important under regulations such as the General Data Protection Regulation (GDPR) in Europe and the California Consumer Privacy Act (CCPA). - **Removal of malicious contributions**: During the FL process, malicious clients may inject backdoors through updates, affecting the performance of the global model. Therefore, a mechanism is needed to remove these malicious contributions without destroying the valid knowledge already obtained. - **Avoiding retraining**: In an FL environment, retraining the entire model to remove the data contributions of specific clients is impractical, as it not only consumes a lot of resources but also becomes impossible when clients are no longer available. #### Experimental Validation - Researchers demonstrated through experiments the performance differences of the global model at different training rounds, proving that even if clients exit the training, their data contributions remain in the global model. Specifically, the global model's performance on test data is similar whether or not it includes specific clients, but it shows significantly higher accuracy on the training data of those clients. #### Main Objectives - Propose the FU algorithm, which can effectively remove the data contributions of specific clients without retraining the global model. - Design efficient and implementable FU schemes, including evaluation metrics and technical guidelines. - Review existing FU methods and provide a detailed analysis based on a novel classification system. - Identify current technical challenges and propose future research directions.