Abstract:The fixed-size context of Transformer makes GPT models incapable of generating arbitrarily long text. In this paper, we introduce RecurrentGPT, a language-based simulacrum of the recurrence mechanism in RNNs. RecurrentGPT is built upon a large language model (LLM) such as ChatGPT and uses natural language to simulate the Long Short-Term Memory mechanism in an LSTM. At each timestep, RecurrentGPT generates a paragraph of text and updates its language-based long-short term memory stored on the hard drive and the prompt, respectively. This recurrence mechanism enables RecurrentGPT to generate texts of arbitrary length without forgetting. Since human users can easily observe and edit the natural language memories, RecurrentGPT is interpretable and enables interactive generation of long text. RecurrentGPT is an initial step towards next-generation computer-assisted writing systems beyond local editing suggestions. In addition to producing AI-generated content (AIGC), we also demonstrate the possibility of using RecurrentGPT as an interactive fiction that directly interacts with consumers. We call this usage of generative models by ``AI As Contents'' (AIAC), which we believe is the next form of conventional AIGC. We further demonstrate the possibility of using RecurrentGPT to create personalized interactive fiction that directly interacts with readers instead of interacting with writers. More broadly, RecurrentGPT demonstrates the utility of borrowing ideas from popular model designs in cognitive science and deep learning for prompting LLMs. Our code is available at <a class="link-external link-https" href="https://github.com/aiwaves-cn/RecurrentGPT" rel="external noopener nofollow">this https URL</a> and an online demo is available at <a class="link-external link-https" href="https://www.aiwaves.org/recurrentgpt" rel="external noopener nofollow">this https URL</a>.

Non-iterative Parallel Text Generation via Glancing Transformer

Glancing Transformer for Non-Autoregressive Neural Machine Translation.

Latent-Glat: Glancing at Latent Variables for Parallel Text Generation

Emage: Non-Autoregressive Text-to-Image Generation

Diffusion Glancing Transformer for Parallel Sequence to Sequence Learning

Diff-Glat: Diffusion Glancing Transformer for Parallel Sequence to Sequence Learning

On the Learning of Non-Autoregressive Transformers.

Cascaded Text Generation with Markov Transformers

Accelerating Transformer Inference for Translation via Parallel Decoding

Non-Autoregressive Text Generation with Pre-trained Language Models

Unlocking the Power of GANs in Non-Autoregressive Text Generation

Non-autoregressive Transformer by Position Learning

Lossless Speedup of Autoregressive Translation with Generalized Aggressive Decoding

RecycleGPT: An Autoregressive Language Model with Recyclable Module

Paraformer: Fast and Accurate Parallel Transformer for Non-autoregressive End-to-End Speech Recognition

PNAT: Non-autoregressive Transformer by Position Learning

Lossless Acceleration of Large Language Model via Adaptive N-gram Parallel Decoding

Fast Chain-of-Thought: A Glance of Future from Parallel Decoding Leads to Answers Faster

Fast Decoding in Sequence Models using Discrete Latent Variables

Semi-Autoregressive Neural Machine Translation

RecurrentGPT: Interactive Generation of (Arbitrarily) Long Text