Robust and Scalable Model Editing for Large Language Models

Yingfa Chen,Zhengyan Zhang,Xu Han,Chaojun Xiao,Zhiyuan Liu,Chen Chen,Kuai Li,Tao Yang,Maosong Sun

2024-03-26

Abstract:Large language models (LLMs) can make predictions using parametric knowledge--knowledge encoded in the model weights--or contextual knowledge--knowledge presented in the context. In many scenarios, a desirable behavior is that LLMs give precedence to contextual knowledge when it conflicts with the parametric knowledge, and fall back to using their parametric knowledge when the context is irrelevant. This enables updating and correcting the model's knowledge by in-context editing instead of retraining. Previous works have shown that LLMs are inclined to ignore contextual knowledge and fail to reliably fall back to parametric knowledge when presented with irrelevant context. In this work, we discover that, with proper prompting methods, instruction-finetuned LLMs can be highly controllable by contextual knowledge and robust to irrelevant context. Utilizing this feature, we propose EREN (Edit models by REading Notes) to improve the scalability and robustness of LLM editing. To better evaluate the robustness of model editors, we collect a new dataset, that contains irrelevant questions that are more challenging than the ones in existing datasets. Empirical results show that our method outperforms current state-of-the-art methods by a large margin. Unlike existing techniques, it can integrate knowledge from multiple edits, and correctly respond to syntactically similar but semantically unrelated inputs (and vice versa). The source code can be found at

Computation and Language,Machine Learning

What problem does this paper attempt to address?

The problem addressed in the paper is how to prioritize the use of contextual knowledge over parameter knowledge in large language models (LLMs), and fallback to parameter knowledge when the context is not relevant. Existing methods suffer from a lack of utilization of context, inability to reliably fallback to parameter knowledge, and sensitivity to irrelevant context. The paper proposes the EREN (Editing via Reading Executable Notes) method, which enhances the scalability and robustness of editing through memory-based editing and addresses the challenges of dealing with a large number of edits and irrelevant edits. Additionally, they create a new dataset to evaluate the robustness of model editors more accurately. Experimental results demonstrate that EREN outperforms current state-of-the-art methods in model editing tasks.

Robust and Scalable Model Editing for Large Language Models

On the Robustness of Editing Large Language Models

Editing Large Language Models: Problems, Methods, and Opportunities

Keys to Robust Edits: from Theoretical Insights to Practical Advances

Is it Possible to Edit Large Language Models Robustly?

Model Editing for LLMs4Code: How Far are We?

A Comprehensive Study of Knowledge Editing for Large Language Models

MELO: Enhancing Model Editing with Neuron-Indexed Dynamic LoRA

AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models

Cross-Lingual Knowledge Editing in Large Language Models

InstructEdit: Instruction-based Knowledge Editing for Large Language Models

Should We Really Edit Language Models? On the Evaluation of Edited Language Models

ConceptEdit: Conceptualization-Augmented Knowledge Editing in Large Language Models for Commonsense Reasoning

Editing Conceptual Knowledge for Large Language Models

ELDER: Enhancing Lifelong Model Editing with Mixture-of-LoRA

Language Anisotropic Cross-Lingual Model Editing

Uncovering Overfitting in Large Language Model Editing

Memory-Based Model Editing at Scale