Abstract:The unlearning problem of deep learning models, once primarily an academic concern, has become a prevalent issue in the industry. The significant advances in text-to-image generation techniques have prompted global discussions on privacy, copyright, and safety, as numerous unauthorized personal IDs, content, artistic creations, and potentially harmful materials have been learned by these models and later utilized to generate and distribute uncontrolled content. To address this challenge, we propose \textbf{Forget-Me-Not}, an efficient and low-cost solution designed to safely remove specified IDs, objects, or styles from a well-configured text-to-image model in as little as 30 seconds, without impairing its ability to generate other content. Alongside our method, we introduce the \textbf{Memorization Score (M-Score)} and \textbf{ConceptBench} to measure the models' capacity to generate general concepts, grouped into three primary categories: ID, object, and style. Using M-Score and ConceptBench, we demonstrate that Forget-Me-Not can effectively eliminate targeted concepts while maintaining the model's performance on other concepts. Furthermore, Forget-Me-Not offers two practical extensions: a) removal of potentially harmful or NSFW content, and b) enhancement of model accuracy, inclusion and diversity through \textbf{concept correction and disentanglement}. It can also be adapted as a lightweight model patch for Stable Diffusion, allowing for concept manipulation and convenient distribution. To encourage future research in this critical area and promote the development of safe and inclusive generative models, we will open-source our code and ConceptBench at \href{<a class="link-external link-https" href="https://github.com/SHI-Labs/Forget-Me-Not" rel="external noopener nofollow">this https URL</a>}{<a class="link-external link-https" href="https://github.com/SHI-Labs/Forget-Me-Not" rel="external noopener nofollow">this https URL</a>}.

Erasing Concepts from Text-to-Image Diffusion Models with Few-shot Unlearning

Reliable and Efficient Concept Erasure of Text-to-Image Diffusion Models

Learn, Imagine and Create: Text-to-Image Generation from Prior Knowledge.

Unlearning Concepts from Text-to-Video Diffusion Models

Circumventing Concept Erasure Methods For Text-to-Image Generative Models

Memories of Forgotten Concepts

Precise, Fast, and Low-cost Concept Erasure in Value Space: Orthogonal Complement Matters

Erasing Concepts from Diffusion Models

Receler: Reliable Concept Erasing of Text-to-Image Diffusion Models via Lightweight Erasers

Ablating Concepts in Text-to-Image Diffusion Models

All but One: Surgical Concept Erasing with Model Preservation in Text-to-Image Diffusion Models

Forget-Me-Not: Learning to Forget in Text-to-Image Diffusion Models

Removing Undesirable Concepts in Text-to-Image Diffusion Models with Learnable Prompts

Unlearning Concepts in Diffusion Model via Concept Domain Correction and Concept Preserving Gradient

ConceptPrune: Concept Editing in Diffusion Models via Skilled Neuron Pruning

EIUP: A Training-Free Approach to Erase Non-Compliant Concepts Conditioned on Implicit Unsafe Prompts

Separable Multi-Concept Erasure from Diffusion Models

MACE: Mass Concept Erasure in Diffusion Models

Unlearning or Concealment? A Critical Analysis and Evaluation Metrics for Unlearning in Diffusion Models

Robust Concept Erasure Using Task Vectors