Abstract:Lifelong learning (LL) is vital for advanced task-oriented dialogue (ToD) systems. To address the catastrophic forgetting issue of LL, generative replay methods are widely employed to consolidate past knowledge with generated pseudo samples. However, most existing generative replay methods use only a single task-specific token to control their models. This scheme is usually not strong enough to constrain the generative model due to insufficient information involved. In this paper, we propose a novel method, prompt conditioned VAE for lifelong learning (PCLL), to enhance generative replay by incorporating tasks' statistics. PCLL captures task-specific distributions with a conditional variational autoencoder, conditioned on natural language prompts to guide the pseudo-sample generation. Moreover, it leverages a distillation process to further consolidate past knowledge by alleviating the noise in pseudo samples. Experiments on natural language understanding tasks of ToD systems demonstrate that PCLL significantly outperforms competitive baselines in building LL models.

What problem does this paper attempt to address?

### Problems the paper attempts to solve This paper aims to solve the problem of lifelong learning (LL) in task - oriented dialogue systems (ToD), especially how to alleviate catastrophic forgetting. During the continuous learning process, the model needs to keep learning new tasks while retaining the previously learned knowledge. However, the existing generative replay methods usually use a single task - specific token to control the generative model. This method often fails to effectively constrain the generative model due to insufficient information, resulting in low - quality pseudo - samples and thus affecting the performance of the model. To solve these problems, the paper proposes a new method - Prompt Conditioned Variational Auto - Encoder (PCLL), which enhances generative replay by combining task - statistical information. Specifically, PCLL uses Conditional Variational Auto - Encoder (CVAE) to capture task - specific distributions and uses natural - language prompts to guide the generation of pseudo - samples. In addition, PCLL also adopts a knowledge distillation process to further consolidate previous knowledge and reduce the impact of noise in pseudo - samples on model performance. ### Main contributions 1. **Proposed a new method (PCLL)**: Enhanced generative replay for constructing the continuous - learning natural - language - understanding (NLU) module of task - oriented dialogue systems. 2. **Used prompt - conditioned modeling of task distributions**: PCLL uses CVAE to capture the latent distribution specific to the task and uses knowledge distillation to further avoid forgetting. 3. **Extensive experiments and comprehensive analysis**: Verified the superior performance of PCLL and the high - quality pseudo - samples it generates. ### Method overview PCLL includes two main components: - **A language - model - based task solver**: Used to solve NLU tasks. - **A CVAE - based generator**: Used to generate pseudo - samples. When learning a new task, PCLL first uses the trained model to generate pseudo - samples of previous tasks, and then mixes these pseudo - samples with the data of the current task for training. In this way, the model can consolidate the knowledge of previous tasks while learning new tasks. ### Experimental results The experimental results show that PCLL significantly outperforms other baseline methods in intent detection and slot - filling tasks. Specifically: - **Higher average score (Score) and learning - curve area (LCA)**, which reflect the performance of the model throughout the continuous - learning process. - **Better anti - forgetting ability**: When switching to a new task, PCLL can better retain the knowledge of previous tasks and reduce performance degradation. ### Conclusion By combining task - statistical information and knowledge distillation, PCLL effectively enhances generative replay, solves the problem of catastrophic forgetting in continuous learning, and significantly improves the performance of task - oriented dialogue systems.

Prompt Conditioned VAE: Enhancing Generative Replay for Lifelong Learning in Task-Oriented Dialogue

Continual Learning for Natural Language Generation in Task-oriented Dialog Systems

Deep Reinforcement Learning for Dialogue Generation

CLIP with Generative Latent Replay: a Strong Baseline for Incremental Learning

Looking through the past: better knowledge retention for generative replay in continual learning

Empowering Large Language Model for Continual Video Question Answering with Collaborative Prompting

Adiabatic replay for continual learning

Adaptive Memory Replay for Continual Learning

Replay-enhanced Continual Reinforcement Learning

Learning latent representations across multiple data domains using Lifelong VAEGAN

Generative Feature Replay with Orthogonal Weight Modification for Continual Learning

APLe: Token-Wise Adaptive for Multi-Modal Prompt Learning

VLM Agents Generate Their Own Memories: Distilling Experience into Embodied Programs of Thought

Prompting to Prompt for Rehearsal-Free Class Incremental Learning.

Recursively Summarizing Enables Long-Term Dialogue Memory in Large Language Models

Continual Prompt Tuning for Dialog State Tracking

Class-Incremental Learning Using Generative Experience Replay Based on Time-aware Regularization

Expedited Training of Visual Conditioned Language Generation via Redundancy Reduction

SEEKR: Selective Attention-Guided Knowledge Retention for Continual Learning of Large Language Models