Abstract:The advent of Large Language Models (LLMs) has revolutionized natural language processing, enabling advanced understanding and reasoning capabilities across a variety of tasks. Fine-tuning these models for specific domains, particularly through Parameter-Efficient Fine-Tuning (PEFT) strategies like LoRA, has become a prevalent practice due to its efficiency. However, this raises significant privacy and security concerns, as models may inadvertently retain and disseminate sensitive or undesirable information. To address these issues, we introduce a novel instance-wise unlearning framework, LLMEraser, which systematically categorizes unlearning tasks and applies precise parameter adjustments using influence functions. Unlike traditional unlearning techniques that are often limited in scope and require extensive retraining, LLMEraser is designed to handle a broad spectrum of unlearning tasks without compromising model performance. Extensive experiments on benchmark datasets demonstrate that LLMEraser excels in efficiently managing various unlearning scenarios while maintaining the overall integrity and efficacy of the models.

What problem does this paper attempt to address?

The main problem that this paper attempts to solve is that when large - language models (LLMs) are fine - tuned in specific domains through parameter - efficient fine - tuning (PEFT) strategies such as LoRA, they may inadvertently retain and spread sensitive or unwanted information, thus causing privacy and security issues. To address these issues, the authors introduce a new instance - level forgetting framework - LLMEraser. ### Specific Problems and Solutions 1. **Privacy and Security Issues** - **Problem**: LLMs may remember and spread sensitive, biased, copyrighted, or harmful information during the fine - tuning process. - **Solution**: LLMEraser systematically classifies forgetting tasks and uses influence functions to precisely adjust parameters, ensuring that specific data can be removed without affecting the overall performance of the model. 2. **Limitations of Existing Methods** - **Problem**: Traditional forgetting techniques usually require a large amount of retraining and have limited effectiveness when dealing with complex instance - level forgetting tasks. - **Solution**: LLMEraser is designed to handle a wide range of forgetting tasks without the need for retraining, thus improving efficiency and reducing resource consumption. 3. **Instance - Level Forgetting Tasks** - **Problem**: Existing forgetting methods mainly focus on removing specific concepts, which are likely to affect related concepts. - **Solution**: LLMEraser expands the scope of forgetting tasks, including instance removal (IR), query modification (QM), and response correction (RC), allowing for more fine - grained control of model behavior. ### Technical Implementation - **Application of Influence Functions**: LLMEraser uses influence functions to directly calculate parameter changes, avoiding the time - consuming fine - tuning or retraining process. - **Optimization Algorithm**: To efficiently calculate the inverse Hessian - vector product, LLMEraser transforms it into a finite - sum quadratic programming problem, significantly reducing the computational complexity and the error caused by random estimation. ### Experimental Verification The authors verified the effectiveness of LLMEraser through multiple experiments, including data removal, query modification, and response correction tasks at different ratios. The experimental results show that LLMEraser performs excellently in various forgetting tasks and can effectively remove the influence of specific data while maintaining the overall performance of the model. ### Summary LLMEraser provides a general and efficient parameter - efficient forgetting framework that can effectively handle various instance - level forgetting tasks without sacrificing model performance, solving the privacy and security issues that LLMs may cause during the fine - tuning process.

Unified Parameter-Efficient Unlearning for LLMs

LMEraser: Large Model Unlearning through Adaptive Prompt Tuning

Unlearn What You Want to Forget: Efficient Unlearning for LLMs

ULMR: Unlearning Large Language Models Via Negative Response and Model Parameter Average

A Closer Look at Machine Unlearning for Large Language Models

Exact and Efficient Unlearning for Large Language Model-based Recommendation

Learning to Refuse: Towards Mitigating Privacy Risks in LLMs

Reversing the Forget-Retain Objectives: An Efficient LLM Unlearning Framework from Logit Difference

LoRA Unlearns More and Retains More (Student Abstract)

Practical Unlearning for Large Language Models

UNLEARN Efficient Removal of Knowledge in Large Language Models

MEOW: MEMOry Supervised LLM Unlearning Via Inverted Facts

Soft Prompting for Unlearning in Large Language Models

DLoRA: Distributed Parameter-Efficient Fine-Tuning Solution for Large Language Model

Large Language Model Unlearning via Embedding-Corrupted Prompts

Rethinking Machine Unlearning for Large Language Models

LoRETTA: Low-Rank Economic Tensor-Train Adaptation for Ultra-Low-Parameter Fine-Tuning of Large Language Models

Towards Robust and Cost-Efficient Knowledge Unlearning for Large Language Models

Machine Unlearning in Large Language Models

LLM Unlearning via Loss Adjustment with Only Forget Data

Machine Unlearning of Pre-trained Large Language Models