Abstract:With the resurgent interest in building open-domain dialogue systems, the dialogue generation task has attracted increasing attention over the past few years. This task is usually formulated as a conditional generation problem, which aims to generate a natural and meaningful response given dialogue contexts and specific constraints, such as persona. And maintaining a consistent persona is essential for the dialogue systems to gain trust from the users. Although tremendous advancements have been brought, traditional persona-based dialogue models are typically trained by leveraging a large number of persona-dense dialogue examples. Yet, such persona-dense training data are expensive to obtain, leading to a limited scale. This work presents a novel approach to learning from limited training examples by regarding consistency understanding as a regularization of response generation. To this end, we propose a novel stack-propagation framework for learning a generation and understanding <a class="link-external link-http" href="http://pipeline.Specifically" rel="external noopener nofollow">this http URL</a>, the framework stacks a Transformer encoder and two Transformer decoders, where the first decoder models response generation and the second serves as a regularizer and jointly models response generation and consistency understanding. The proposed framework can benefit from the stacked encoder and decoders to learn from much smaller personalized dialogue data while maintaining competitive performance. Under different low-resource settings, subjective and objective evaluations prove that the stack-propagation framework outperforms strong baselines in response quality and persona consistency and largely overcomes the shortcomings of traditional models that rely heavily on the persona-dense dialogue data.

Learning to Customize Model Structures for Few-shot Dialogue Generation Tasks

Data Distillation for Controlling Specificity in Dialogue Generation.

Language Models as Few-Shot Learner for Task-Oriented Dialogue Systems

Generating Personalized Dialogue via Multi-Task Meta-Learning

Towards Generalized Models for Task-oriented Dialogue Modeling on Spoken Conversations

Dialogue Generation on Infrequent Sentence Functions via Structured Meta-Learning

Learning to Know Myself: A Coarse-to-Fine Persona-Aware Training Framework for Personalized Dialogue Generation

Small Changes Make Big Differences: Improving Multi-turn Response Selection in Dialogue Systems Via Fine-Grained Contrastive Learning

Stabilized In-Context Learning with Pre-trained Language Models for Few Shot Dialogue State Tracking

Learning from My Friends: Few-Shot Personalized Conversation Systems via Social Networks

A Stack-Propagation Framework for Low-Resource Personalized Dialogue Generation

Few-Shot Dialogue Generation Without Annotated Data: A Transfer Learning Approach

Less is More: Learning to Refine Dialogue History for Personalized Dialogue Generation

Few-Shot Structured Policy Learning for Multi-Domain and Multi-Task Dialogues

CTRLStruct: Dialogue Structure Learning for Open-Domain Response Generation

Combining Curriculum Learning and Knowledge Distillation for Dialogue Generation

Different Strokes for Different Folks: Investigating Appropriate Further Pre-training Approaches for Diverse Dialogue Tasks

Diversifying Dialog Generation via Adaptive Label Smoothing

"In Dialogues We Learn": Towards Personalized Dialogue Without Pre-defined Profiles through In-Dialogue Learning

A Gaussian Mixture Model for Dialogue Generation with Dynamic Parameter Sharing Strategy.

Personalized Dialogue Response Generation Learned from Monologues