Abstract:Machine unlearning refers to the process of mitigating the influence of specific training data on machine learning models based on removal requests from data owners. However, one important area that has been largely overlooked in the research of unlearning is reinforcement learning. Reinforcement learning focuses on training an agent to make optimal decisions within an environment to maximize its cumulative rewards. During the training, the agent tends to memorize the features of the environment, which raises a significant concern about privacy. As per data protection regulations, the owner of the environment holds the right to revoke access to the agent's training data, thus necessitating the development of a novel and pressing research field, known as \emph{reinforcement unlearning}. Reinforcement unlearning focuses on revoking entire environments rather than individual data samples. This unique characteristic presents three distinct challenges: 1) how to propose unlearning schemes for environments; 2) how to avoid degrading the agent's performance in remaining environments; and 3) how to evaluate the effectiveness of unlearning. To tackle these challenges, we propose two reinforcement unlearning methods. The first method is based on decremental reinforcement learning, which aims to erase the agent's previously acquired knowledge gradually. The second method leverages environment poisoning attacks, which encourage the agent to learn new, albeit incorrect, knowledge to remove the unlearning environment. Particularly, to tackle the third challenge, we introduce the concept of ``environment inference attack'' to evaluate the unlearning outcomes. The source code is available at \url{https://anonymous.4open.science/r/Reinforcement-Unlearning-D347}.

Zero-shot Class Unlearning via Layer-wise Relevance Analysis and Neuronal Path Perturbation

An Information Theoretic Approach to Machine Unlearning

Machine unlearning through fine-grained model parameters perturbation

Game-Theoretic Machine Unlearning: Mitigating Extra Privacy Leakage

An Overview of Machine Unlearning

Machine unlearning in brain-inspired neural network paradigms

Pseudo-Probability Unlearning: Towards Efficient and Privacy-Preserving Machine Unlearning

Releasing Malevolence from Benevolence: The Menace of Benign Data on Machine Unlearning

Learn What You Want to Unlearn: Unlearning Inversion Attacks against Machine Unlearning

Learn to Unlearn: A Survey on Machine Unlearning

A Survey on Machine Unlearning: Techniques and New Emerged Privacy Risks

Silver Linings in the Shadows: Harnessing Membership Inference for Machine Unlearning

Boundary Unlearning

Enhancing Privacy Protection for Online Learning Resource Recommendation with Machine Unlearning

Backdoor Attacks via Machine Unlearning

Unlearn and Burn: Adversarial Machine Unlearning Requests Destroy Model Accuracy

Machine Unlearning: A Comprehensive Survey

When Machine Unlearning Jeopardizes Privacy

Ensuring User Privacy and Model Security via Machine Unlearning: A Review

Boundary Unlearning: Rapid Forgetting of Deep Networks Via Shifting the Decision Boundary

Reinforcement Unlearning