Abstract:The unlearning problem of deep learning models, once primarily an academic concern, has become a prevalent issue in the industry. The significant advances in text-to-image generation techniques have prompted global discussions on privacy, copyright, and safety, as numerous unauthorized personal IDs, content, artistic creations, and potentially harmful materials have been learned by these models and later utilized to generate and distribute uncontrolled content. To address this challenge, we propose \textbf{Forget-Me-Not}, an efficient and low-cost solution designed to safely remove specified IDs, objects, or styles from a well-configured text-to-image model in as little as 30 seconds, without impairing its ability to generate other content. Alongside our method, we introduce the \textbf{Memorization Score (M-Score)} and \textbf{ConceptBench} to measure the models' capacity to generate general concepts, grouped into three primary categories: ID, object, and style. Using M-Score and ConceptBench, we demonstrate that Forget-Me-Not can effectively eliminate targeted concepts while maintaining the model's performance on other concepts. Furthermore, Forget-Me-Not offers two practical extensions: a) removal of potentially harmful or NSFW content, and b) enhancement of model accuracy, inclusion and diversity through \textbf{concept correction and disentanglement}. It can also be adapted as a lightweight model patch for Stable Diffusion, allowing for concept manipulation and convenient distribution. To encourage future research in this critical area and promote the development of safe and inclusive generative models, we will open-source our code and ConceptBench at \href{<a class="link-external link-https" href="https://github.com/SHI-Labs/Forget-Me-Not" rel="external noopener nofollow">this https URL</a>}{<a class="link-external link-https" href="https://github.com/SHI-Labs/Forget-Me-Not" rel="external noopener nofollow">this https URL</a>}.

Learn to Forget: Memorization Elimination for Neural Networks.

Learn to Forget: Machine Unlearning Via Neuron Masking

FedME2: Memory Evaluation & Erase Promoting Federated Unlearning in DTMN

Measuring Forgetting of Memorized Training Examples

Active forgetting via influence estimation for neural networks

Measuring Catastrophic Forgetting in Neural Networks

Learning with Recoverable Forgetting

Forget-Me-Not: Learning to Forget in Text-to-Image Diffusion Models

Can Bad Teaching Induce Forgetting? Unlearning in Deep Networks Using an Incompetent Teacher

Learning by Active Forgetting for Neural Networks

Memorization in deep learning: A survey

Machine Unlearning using Forgetting Neural Networks

Fortuitous Forgetting in Connectionist Networks

Uncovering Latent Memories: Assessing Data Leakage and Memorization Patterns in Frontier AI Models

Pseudo-Probability Unlearning: Towards Efficient and Privacy-Preserving Machine Unlearning

Challenging Forgets: Unveiling the Worst-Case Forget Sets in Machine Unlearning

Memory Recall: A Simple Neural Network Training Framework Against Catastrophic Forgetting

Amnesiac Machine Learning

Deep Unlearning: Fast and Efficient Gradient-free Approach to Class Forgetting

Leveraging Unlabeled Data to Track Memorization

One-Shot Machine Unlearning with Mnemonic Code