Abstract:Over the past decades, the abundance of personal data has led to the rapid development of machine learning models and important advances in artificial intelligence (AI). However, alongside all the achievements, there are increasing privacy threats and security risks that may cause significant losses for data providers. Recent legislation requires that the private information about a user should be removed from a database as well as machine learning models upon certain deletion requests. While erasing data records from memory storage is straightforward, it is often challenging to remove the influence of particular data samples from a model that has already been trained. Machine unlearning is an emerging paradigm that aims to make machine learning models "forget" what they have learned about particular data. Nevertheless, the unlearning issue for federated learning has not been completely addressed due to its special working mode. First, existing solutions crucially rely on retraining-based model calibration, which is likely unavailable and can pose new privacy risks for federated learning frameworks. Second, today's efficient unlearning strategies are mainly designed for convex problems, which are incapable of handling more complicated learning tasks like neural networks. To overcome these limitations, we took advantage of differential privacy and developed an efficient machine unlearning algorithm named FedRecovery. The FedRecovery erases the impact of a client by removing a weighted sum of gradient residuals from the global model, and tailors the Gaussian noise to make the unlearned model and retrained model statistically indistinguishable. Furthermore, the algorithm neither requires retraining-based fine-tuning nor needs the assumption of convexity. Theoretical analyses show the rigorous indistinguishability guarantee. Additionally, the experiment results on real-world datasets demonstrate that the FedRecovery is efficient and is able to produce a model that performs similarly to the retrained one.

To Be Forgotten or To Be Fair: Unveiling Fairness Implications of Machine Unlearning Methods

Fair Machine Unlearning: Data Removal while Mitigating Disparities

Exploring Fairness in Educational Data Mining in the Context of the Right to be Forgotten

A Duty to Forget, a Right to Be Assured? Exposing Vulnerabilities in Machine Unlearning Services

Debiasing Machine Unlearning with Counterfactual Examples

Learn to Unlearn: A Survey on Machine Unlearning

From Machine Learning to Machine Unlearning: Complying with GDPR's Right to be Forgotten while Maintaining Business Value of Predictive Models

An Overview of Machine Unlearning

Machine Unlearning: A Comprehensive Survey

Learn to Forget: Machine Unlearning Via Neuron Masking

Gone but Not Forgotten: Improved Benchmarks for Machine Unlearning

Machine Unlearning: Taxonomy, Metrics, Applications, Challenges, and Prospects

The Right to Be Forgotten in Federated Learning: an Efficient Realization with Rapid Retraining

A Survey of Machine Unlearning

Machine Unlearning in Forgettability Sequence

When Machine Unlearning Jeopardizes Privacy

Pseudo-Probability Unlearning: Towards Efficient and Privacy-Preserving Machine Unlearning

FedRecovery: Differentially Private Machine Unlearning for Federated Learning Frameworks.

Why Fine-Tuning Struggles with Forgetting in Machine Unlearning? Theoretical Insights and a Remedial Approach

Random Relabeling for Efficient Machine Unlearning

An Information Theoretic Approach to Machine Unlearning