Abstract:The training of modern large language models (LLMs) takes place in a regime where most training examples are seen only a few times by the model during the course of training. What does a model remember about such examples seen only a few times during training and how long does that memory persist in the face of continuous training with new examples? Here, we investigate these questions through simple recognition, recall, and retention experiments with LLMs. In recognition experiments, we ask if the model can distinguish the seen example from a novel example; in recall experiments, we ask if the model can correctly recall the seen example when cued by a part of it; and in retention experiments, we periodically probe the model's memory for the original examples as the model is trained continuously with new examples. We find that a single exposure is generally sufficient for a model to achieve near perfect accuracy even in very challenging recognition experiments. We estimate that the recognition performance of even small language models easily exceeds human recognition performance reported in similar experiments with humans (Shepard, 1967). Achieving near perfect recall takes more exposures, but most models can do it in just 3 exposures. The flip side of this remarkable capacity for fast learning is that precise memories are quickly overwritten: recall performance for the original examples drops steeply over the first 10 training updates with new examples, followed by a more gradual decline. Even after 100K updates, however, some of the original examples are still recalled near perfectly. A qualitatively similar retention pattern has been observed in human long-term memory retention studies before (Bahrick, 1984). Finally, recognition is much more robust to interference than recall and memory for natural language sentences is generally superior to memory for stimuli without structure.

Demystifying Verbatim Memorization in Large Language Models

Understanding Memorisation in LLMs: Dynamics, Influencing Factors, and Implications

Memorization Over Reasoning? Exposing and Mitigating Verbatim Memorization in Large Language Models' Character Understanding Evaluation

SoK: Memorization in General-Purpose Large Language Models

Emergent and Predictable Memorization in Large Language Models

Detecting Memorization in Large Language Models

Undesirable Memorization in Large Language Models: A Survey

Preventing Verbatim Memorization in Language Models Gives a False Sense of Privacy

A Multi-Perspective Analysis of Memorization in Large Language Models

Quantifying and Analyzing Entity-level Memorization in Large Language Models

Mitigating Catastrophic Forgetting in Large Language Models with Self-Synthesized Rehearsal

Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMs

Continual Memorization of Factoids in Large Language Models

Elephants Never Forget: Memorization and Learning of Tabular Data in Large Language Models

Memorization Without Overfitting: Analyzing the Training Dynamics of Large Language Models

Predicting and analyzing memorization within fine-tuned Large Language Models

To Each (Textual Sequence) Its Own: Improving Memorized-Data Unlearning in Large Language Models

Generalization v.s. Memorization: Tracing Language Models' Capabilities Back to Pretraining Data

Rethinking LLM Memorization through the Lens of Adversarial Compression

Recognition, recall, and retention of few-shot memories in large language models

Are Large Language Models Memorizing Bug Benchmarks?