Abstract:Machine learning models trained on vast amounts of real or synthetic data often achieve outstanding predictive performance across various domains. However, this utility comes with increasing concerns about privacy, as the training data may include sensitive information. To address these concerns, machine unlearning has been proposed to erase specific data samples from models. While some unlearning techniques efficiently remove data at low costs, recent research highlights vulnerabilities where malicious users could request unlearning on manipulated data to compromise the model. Despite these attacks' effectiveness, perturbed data differs from original training data, failing hash verification. Existing attacks on machine unlearning also suffer from practical limitations and require substantial additional knowledge and resources. To fill the gaps in current unlearning attacks, we introduce the Unlearning Usability Attack. This model-agnostic, unlearning-agnostic, and budget-friendly attack distills data distribution information into a small set of benign data. These data are identified as benign by automatic poisoning detection tools due to their positive impact on model training. While benign for machine learning, unlearning these data significantly degrades model information. Our evaluation demonstrates that unlearning this benign data, comprising no more than 1% of the total training data, can reduce model accuracy by up to 50%. Furthermore, our findings show that well-prepared benign data poses challenges for recent unlearning techniques, as erasing these synthetic instances demands higher resources than regular data. These insights underscore the need for future research to reconsider "data poisoning" in the context of machine unlearning.

Privacy-Preserving Debiasing using Data Augmentation and Machine Unlearning

Silver Linings in the Shadows: Harnessing Membership Inference for Machine Unlearning

When Machine Unlearning Jeopardizes Privacy

Gone but Not Forgotten: Improved Benchmarks for Machine Unlearning

Learn to Unlearn: A Survey on Machine Unlearning

Fair Machine Unlearning: Data Removal while Mitigating Disparities

Model Debiasing by Learnable Data Augmentation

Releasing Malevolence from Benevolence: The Menace of Benign Data on Machine Unlearning

Game-Theoretic Machine Unlearning: Mitigating Extra Privacy Leakage

Pseudo-Probability Unlearning: Towards Efficient and Privacy-Preserving Machine Unlearning

Enhancing Privacy Protection for Online Learning Resource Recommendation with Machine Unlearning

Fed-AugMix: Balancing Privacy and Utility via Data Augmentation

How Does Data Augmentation Affect Privacy in Machine Learning?

Forget to Flourish: Leveraging Machine-Unlearning on Pretrained Language Models for Privacy Leakage

Machine unlearning through fine-grained model parameters perturbation

Langevin Unlearning: A New Perspective of Noisy Gradient Descent for Machine Unlearning

Learn What You Want to Unlearn: Unlearning Inversion Attacks against Machine Unlearning

Privacy at a Price: Exploring its Dual Impact on AI Fairness

Privacy-preserving Machine Learning through Data Obfuscation

Adversarial Machine Unlearning

Enhancing User-Centric Privacy Protection: An Interactive Framework through Diffusion Models and Machine Unlearning