Contextual Data Augmentation for Task-Oriented Dialog Systems

Dustin Axman,Avik Ray,Shubham Garg,Jing Huang
2023-10-16
Abstract:Collection of annotated dialogs for training task-oriented dialog systems have been one of the key bottlenecks in improving current models. While dialog response generation has been widely studied on the agent side, it is not evident if similar generative models can be used to generate a large variety of, and often unexpected, user inputs that real dialog systems encounter in practice. Existing data augmentation techniques such as paraphrase generation do not take the dialog context into consideration. In this paper, we develop a novel dialog augmentation model that generates a user turn, conditioning on full dialog context. Additionally, with a new prompt design for language model, and output re-ranking, the dialogs generated from our model can be directly used to train downstream dialog systems. On common benchmark datasets MultiWoZ and SGD, we show that our dialog augmentation model generates high quality dialogs and improves dialog success rate by as much as $8\%$ over baseline.
Computation and Language
What problem does this paper attempt to address?
The paper aims to address the issue of insufficient training data in task-oriented dialogue systems. Specifically, it proposes a novel dialogue augmentation model to generate variations of user inputs. These variations not only consider the context of the current dialogue but also incorporate information from future dialogues. Existing data augmentation techniques (such as paraphrase generation) typically do not take dialogue context into account, thus having limited coverage in practical applications. Additionally, the paper introduces a new language model prompt design combined with output reordering techniques, enabling the generated dialogues to be directly used for training downstream dialogue systems. Experiments on standard benchmark datasets MultiWoZ and SGD demonstrate that the dialogues generated by this model are of high quality and significantly improve task completion rates and goal accuracy, with up to an 8% increase in success rate compared to baseline models. This proves that the method can effectively enhance existing dialogue datasets and improve the performance of task-oriented dialogue systems.