Prompt Conditioned VAE: Enhancing Generative Replay for Lifelong Learning in Task-Oriented Dialogue

Yingxiu Zhao,Yinhe Zheng,Zhiliang Tian,Chang Gao,Bowen Yu,Haiyang Yu,Yongbin Li,Jian Sun,Nevin L. Zhang
DOI: https://doi.org/10.48550/arXiv.2210.07783
2022-11-24
Abstract:Lifelong learning (LL) is vital for advanced task-oriented dialogue (ToD) systems. To address the catastrophic forgetting issue of LL, generative replay methods are widely employed to consolidate past knowledge with generated pseudo samples. However, most existing generative replay methods use only a single task-specific token to control their models. This scheme is usually not strong enough to constrain the generative model due to insufficient information involved. In this paper, we propose a novel method, prompt conditioned VAE for lifelong learning (PCLL), to enhance generative replay by incorporating tasks' statistics. PCLL captures task-specific distributions with a conditional variational autoencoder, conditioned on natural language prompts to guide the pseudo-sample generation. Moreover, it leverages a distillation process to further consolidate past knowledge by alleviating the noise in pseudo samples. Experiments on natural language understanding tasks of ToD systems demonstrate that PCLL significantly outperforms competitive baselines in building LL models.
Computation and Language,Artificial Intelligence,Machine Learning
What problem does this paper attempt to address?
### Problems the paper attempts to solve This paper aims to solve the problem of lifelong learning (LL) in task - oriented dialogue systems (ToD), especially how to alleviate catastrophic forgetting. During the continuous learning process, the model needs to keep learning new tasks while retaining the previously learned knowledge. However, the existing generative replay methods usually use a single task - specific token to control the generative model. This method often fails to effectively constrain the generative model due to insufficient information, resulting in low - quality pseudo - samples and thus affecting the performance of the model. To solve these problems, the paper proposes a new method - Prompt Conditioned Variational Auto - Encoder (PCLL), which enhances generative replay by combining task - statistical information. Specifically, PCLL uses Conditional Variational Auto - Encoder (CVAE) to capture task - specific distributions and uses natural - language prompts to guide the generation of pseudo - samples. In addition, PCLL also adopts a knowledge distillation process to further consolidate previous knowledge and reduce the impact of noise in pseudo - samples on model performance. ### Main contributions 1. **Proposed a new method (PCLL)**: Enhanced generative replay for constructing the continuous - learning natural - language - understanding (NLU) module of task - oriented dialogue systems. 2. **Used prompt - conditioned modeling of task distributions**: PCLL uses CVAE to capture the latent distribution specific to the task and uses knowledge distillation to further avoid forgetting. 3. **Extensive experiments and comprehensive analysis**: Verified the superior performance of PCLL and the high - quality pseudo - samples it generates. ### Method overview PCLL includes two main components: - **A language - model - based task solver**: Used to solve NLU tasks. - **A CVAE - based generator**: Used to generate pseudo - samples. When learning a new task, PCLL first uses the trained model to generate pseudo - samples of previous tasks, and then mixes these pseudo - samples with the data of the current task for training. In this way, the model can consolidate the knowledge of previous tasks while learning new tasks. ### Experimental results The experimental results show that PCLL significantly outperforms other baseline methods in intent detection and slot - filling tasks. Specifically: - **Higher average score (Score) and learning - curve area (LCA)**, which reflect the performance of the model throughout the continuous - learning process. - **Better anti - forgetting ability**: When switching to a new task, PCLL can better retain the knowledge of previous tasks and reduce performance degradation. ### Conclusion By combining task - statistical information and knowledge distillation, PCLL effectively enhances generative replay, solves the problem of catastrophic forgetting in continuous learning, and significantly improves the performance of task - oriented dialogue systems.