Abstract:The fixed-size context of Transformer makes GPT models incapable of generating arbitrarily long text. In this paper, we introduce RecurrentGPT, a language-based simulacrum of the recurrence mechanism in RNNs. RecurrentGPT is built upon a large language model (LLM) such as ChatGPT and uses natural language to simulate the Long Short-Term Memory mechanism in an LSTM. At each timestep, RecurrentGPT generates a paragraph of text and updates its language-based long-short term memory stored on the hard drive and the prompt, respectively. This recurrence mechanism enables RecurrentGPT to generate texts of arbitrary length without forgetting. Since human users can easily observe and edit the natural language memories, RecurrentGPT is interpretable and enables interactive generation of long text. RecurrentGPT is an initial step towards next-generation computer-assisted writing systems beyond local editing suggestions. In addition to producing AI-generated content (AIGC), we also demonstrate the possibility of using RecurrentGPT as an interactive fiction that directly interacts with consumers. We call this usage of generative models by ``AI As Contents'' (AIAC), which we believe is the next form of conventional AIGC. We further demonstrate the possibility of using RecurrentGPT to create personalized interactive fiction that directly interacts with readers instead of interacting with writers. More broadly, RecurrentGPT demonstrates the utility of borrowing ideas from popular model designs in cognitive science and deep learning for prompting LLMs. Our code is available at <a class="link-external link-https" href="https://github.com/aiwaves-cn/RecurrentGPT" rel="external noopener nofollow">this https URL</a> and an online demo is available at <a class="link-external link-https" href="https://www.aiwaves.org/recurrentgpt" rel="external noopener nofollow">this https URL</a>.

Fixed global memory for controllable long text generation

GMAT: Global Memory Augmentation for Transformers

MemLong: Memory-Augmented Retrieval for Long Text Modeling

Augmenting Language Models with Long-Term Memory

Global memory transformer for processing long documents

With Greater Text Comes Greater Necessity: Inference-Time Training Helps Long Text Generation

LaMemo: Language Modeling with Look-Ahead Memory

Generalizable Memory-driven Transformer for Multivariate Long Sequence Time-series Forecasting

CAMELoT: Towards Large Language Models with Training-Free Consolidated Associative Memory

HMT: Hierarchical Memory Transformer for Long Context Language Processing

MEMORYLLM: Towards Self-Updatable Large Language Models

Memory-Augmenting Decoder-Only Language Models through Encoders (Student Abstract)

Extended Mind Transformers

Enhancing Long Context Performance in LLMs Through Inner Loop Query Mechanism

Beyond Scaling Laws: Understanding Transformer Performance with Associative Memory

Memory-Augmented Generative Adversarial Transformers

Improving Computation and Memory Efficiency for Real-world Transformer Inference on GPUs

RecurrentGPT: Interactive Generation of (Arbitrarily) Long Text

A Combined Encoder and Transformer Approach for Coherent and High-Quality Text Generation

Linearizing Transformer with Key-Value Memory