Abstract:Today, computer systems hold large amounts of personal data. Yet while such an abundance of data allows breakthroughs in artificial intelligence, and especially machine learning (ML), its existence can be a threat to user privacy, and it can weaken the bonds of trust between humans and AI. Recent regulations now require that, on request, private information about a user must be removed from both computer systems and from ML models, i.e. ``the right to be forgotten''). While removing data from back-end databases should be straightforward, it is not sufficient in the AI context as ML models often `remember' the old data. Contemporary adversarial attacks on trained models have proven that we can learn whether an instance or an attribute belonged to the training data. This phenomenon calls for a new paradigm, namely machine unlearning, to make ML models forget about particular data. It turns out that recent works on machine unlearning have not been able to completely solve the problem due to the lack of common frameworks and resources. Therefore, this paper aspires to present a comprehensive examination of machine unlearning's concepts, scenarios, methods, and applications. Specifically, as a category collection of cutting-edge studies, the intention behind this article is to serve as a comprehensive resource for researchers and practitioners seeking an introduction to machine unlearning and its formulations, design criteria, removal requests, algorithms, and applications. In addition, we aim to highlight the key findings, current trends, and new research areas that have not yet featured the use of machine unlearning but could benefit greatly from it. We hope this survey serves as a valuable resource for ML researchers and those seeking to innovate privacy technologies. Our resources are publicly available at <a class="link-external link-https" href="https://github.com/tamlhp/awesome-machine-unlearning" rel="external noopener nofollow">this https URL</a>.

The Frontier of Data Erasure: Machine Unlearning for Large Language Models

Rethinking Machine Unlearning for Large Language Models

A Closer Look at Machine Unlearning for Large Language Models

Machine Unlearning for Traditional Models and Large Language Models: A Short Survey

Machine Unlearning of Pre-trained Large Language Models

Machine Unlearning in Large Language Models

Learn to Unlearn: A Survey on Machine Unlearning

What can we learn from Data Leakage and Unlearning for Law?

An Overview of Machine Unlearning

Unlearn What You Want to Forget: Efficient Unlearning for LLMs

Machine Unlearning: its nature, scope, and importance for a "delete culture"

Machine Unlearning: Taxonomy, Metrics, Applications, Challenges, and Prospects

CodeUnlearn: Amortized Zero-Shot Machine Unlearning in Language Models Using Discrete Concept

LLM Unlearning via Loss Adjustment with Only Forget Data

A Survey of Machine Unlearning

Federated Learning driven Large Language Models for Swarm Intelligence: A Survey

Machine Unlearning for Document Classification

Privacy Adhering Machine Un-learning in NLP

Underestimated Privacy Risks for Minority Populations in Large Language Model Unlearning

Machine Unlearning: A Comprehensive Survey