Contextual Data Augmentation for Task-Oriented Dialog Systems

Dustin Axman,Avik Ray,Shubham Garg,Jing Huang

2023-10-16

Abstract:Collection of annotated dialogs for training task-oriented dialog systems have been one of the key bottlenecks in improving current models. While dialog response generation has been widely studied on the agent side, it is not evident if similar generative models can be used to generate a large variety of, and often unexpected, user inputs that real dialog systems encounter in practice. Existing data augmentation techniques such as paraphrase generation do not take the dialog context into consideration. In this paper, we develop a novel dialog augmentation model that generates a user turn, conditioning on full dialog context. Additionally, with a new prompt design for language model, and output re-ranking, the dialogs generated from our model can be directly used to train downstream dialog systems. On common benchmark datasets MultiWoZ and SGD, we show that our dialog augmentation model generates high quality dialogs and improves dialog success rate by as much as $8\%$ over baseline.

Computation and Language

What problem does this paper attempt to address?

The paper aims to address the issue of insufficient training data in task-oriented dialogue systems. Specifically, it proposes a novel dialogue augmentation model to generate variations of user inputs. These variations not only consider the context of the current dialogue but also incorporate information from future dialogues. Existing data augmentation techniques (such as paraphrase generation) typically do not take dialogue context into account, thus having limited coverage in practical applications. Additionally, the paper introduces a new language model prompt design combined with output reordering techniques, enabling the generated dialogues to be directly used for training downstream dialogue systems. Experiments on standard benchmark datasets MultiWoZ and SGD demonstrate that the dialogues generated by this model are of high quality and significantly improve task completion rates and goal accuracy, with up to an 8% increase in success rate compared to baseline models. This proves that the method can effectively enhance existing dialogue datasets and improve the performance of task-oriented dialogue systems.

Contextual Data Augmentation for Task-Oriented Dialog Systems

Paraphrase Augmented Task-Oriented Dialog Generation

Learning Towards Selective Data Augmentation for Dialogue Generation.

Data Augmentation for Retrieval- and Generation-Based Dialog Systems

Counterfactual Data Augmentation via Perspective Transition for Open-Domain Dialogues

A Unified Dialogue User Simulator for Few-shot Data Augmentation

Dialogue Distillation: Open-Domain Dialogue Augmentation Using Unpaired Data

Data Augmentation with Paraphrase Generation and Entity Extraction for Multimodal Dialogue System

Comparing Data Augmentation Methods for End-to-End Task-Oriented Dialog Systems

Plan, Generate and Complicate: Improving Low-resource Dialogue State Tracking via Easy-to-Difficult Zero-shot Data Augmentation

Automatically Learning Data Augmentation Policies for Dialogue Tasks

Data Augmentation of Multi-turn Psychological Dialogue via Knowledge-driven Progressive Thought Prompting

Dialog State Tracking with Reinforced Data Augmentation

DialAug: Mixing up Dialogue Contexts in Contrastive Learning for Robust Conversational Modeling

Insufficient Data Can Also Rock！Learning to Converse Using Smaller Data with Augmentation

Data Augmentation Integrating Dialogue Flow and Style to Adapt Spoken Dialogue Systems to Low-Resource User Groups

N-Shot Learning for Augmenting Task-Oriented Dialogue State Tracking

A Unified Data Augmentation Framework for Low-Resource Multi-Domain Dialogue Generation

TOD-DA: Towards Boosting the Robustness of Task-oriented Dialogue Modeling on Spoken Conversations

Variational Hierarchical Dialog Autoencoder for Dialog State Tracking Data Augmentation

Saliency infused dialogue response generation: Improving task oriented text generation using feature attribution