Abstract:Machine learning models trained on vast amounts of real or synthetic data often achieve outstanding predictive performance across various domains. However, this utility comes with increasing concerns about privacy, as the training data may include sensitive information. To address these concerns, machine unlearning has been proposed to erase specific data samples from models. While some unlearning techniques efficiently remove data at low costs, recent research highlights vulnerabilities where malicious users could request unlearning on manipulated data to compromise the model. Despite these attacks' effectiveness, perturbed data differs from original training data, failing hash verification. Existing attacks on machine unlearning also suffer from practical limitations and require substantial additional knowledge and resources. To fill the gaps in current unlearning attacks, we introduce the Unlearning Usability Attack. This model-agnostic, unlearning-agnostic, and budget-friendly attack distills data distribution information into a small set of benign data. These data are identified as benign by automatic poisoning detection tools due to their positive impact on model training. While benign for machine learning, unlearning these data significantly degrades model information. Our evaluation demonstrates that unlearning this benign data, comprising no more than 1% of the total training data, can reduce model accuracy by up to 50%. Furthermore, our findings show that well-prepared benign data poses challenges for recent unlearning techniques, as erasing these synthetic instances demands higher resources than regular data. These insights underscore the need for future research to reconsider "data poisoning" in the context of machine unlearning.

Reconstruction Attacks on Machine Unlearning: Simple Models are Vulnerable

Unlearn and Burn: Adversarial Machine Unlearning Requests Destroy Model Accuracy

Deletion inference, reconstruction, and compliance in machine (un)learning

Gone but Not Forgotten: Improved Benchmarks for Machine Unlearning

Learn What You Want to Unlearn: Unlearning Inversion Attacks against Machine Unlearning

Conditional Matching GAN Guided Reconstruction Attack in Machine Unlearning

Releasing Malevolence from Benevolence: The Menace of Benign Data on Machine Unlearning

When Machine Unlearning Jeopardizes Privacy

Survey of Security and Data Attacks on Machine Unlearning In Financial and E-Commerce

Learn to Unlearn: A Survey on Machine Unlearning

Data Reconstruction Attacks and Defenses: A Systematic Evaluation

Backdoor Attacks via Machine Unlearning

When Machine Learning Models Leak: An Exploration of Synthetic Training Data

Adversarial Machine Unlearning

Verification of Machine Unlearning is Fragile

A Duty to Forget, a Right to Be Assured? Exposing Vulnerabilities in Machine Unlearning Services

Silver Linings in the Shadows: Harnessing Membership Inference for Machine Unlearning

Bounding Reconstruction Attack Success of Adversaries Without Data Priors

Game-Theoretic Machine Unlearning: Mitigating Extra Privacy Leakage

Threats, Attacks, and Defenses in Machine Unlearning: A Survey

Zero-shot Class Unlearning via Layer-wise Relevance Analysis and Neuronal Path Perturbation