Abstract:Current state-of-the-art dialogue systems heavily rely on extensive training datasets. However, challenges arise in domains where domain-specific training datasets are insufficient or entirely absent. To tackle this challenge, we propose a novel data \textbf{A}ugmentation framework for \textbf{M}ulti-\textbf{D}omain \textbf{D}ialogue \textbf{G}eneration, referred to as \textbf{AMD$^2$G}. The AMD$^2$G framework consists of a data augmentation process and a two-stage training approach: domain-agnostic training and domain adaptation training. We posit that domain corpora are a blend of domain-agnostic and domain-specific features, with certain representation patterns shared among diverse domains. Domain-agnostic training aims to enable models to learn these common expressive patterns. To construct domain-agnostic dialogue corpora, we employ a \textit{\textbf{de-domaining}} data processing technique used to remove domain-specific features. By mitigating the effects of domain-specific features, the model trained on the de-domained corpora can effectively learn common expression patterns in different domains. Subsequently, we adapt the learned domain-agnostic features to the target domain through domain adaptation training. We conduct experiments on Chinese dialogue datasets from five different domains and show that AMD$^2$G achieves superior performance compared to both direct training on the target domain corpus and collective training on all five domain corpora. Our work underscores AMD$^2$G as a viable alternative solution for low-resource multi-domain dialogue generation. Code and data associated with our work are available on GitHub repository$^{\text 1}$.

A Unified Dialogue User Simulator for Few-shot Data Augmentation

Dialogue Distillation: Open-Domain Dialogue Augmentation Using Unpaired Data

A Unified Data Augmentation Framework for Low-Resource Multi-Domain Dialogue Generation

Contextual Data Augmentation for Task-Oriented Dialog Systems

Plan, Generate and Complicate: Improving Low-resource Dialogue State Tracking via Easy-to-Difficult Zero-shot Data Augmentation

Learning Towards Selective Data Augmentation for Dialogue Generation.

Data Augmentation Integrating Dialogue Flow and Style to Adapt Spoken Dialogue Systems to Low-Resource User Groups

Controllable and Diverse Data Augmentation with Large Language Model for Low-Resource Open-Domain Dialogue Generation

Data Augmentation for Retrieval- and Generation-Based Dialog Systems

Insufficient Data Can Also Rock！Learning to Converse Using Smaller Data with Augmentation

Automatically Learning Data Augmentation Policies for Dialogue Tasks

Simulating User Satisfaction for the Evaluation of Task-oriented Dialogue Systems

LLM–Assisted Data Augmentation for Chinese Dialogue–Level Dependency Parsing

N-Shot Learning for Augmenting Task-Oriented Dialogue State Tracking

AugESC: Dialogue Augmentation with Large Language Models for Emotional Support Conversation

Few-Shot Dialogue Generation Without Annotated Data: A Transfer Learning Approach

TOD-DA: Towards Boosting the Robustness of Task-oriented Dialogue Modeling on Spoken Conversations

Language Models as Few-Shot Learner for Task-Oriented Dialogue Systems

DFlow: Diverse Dialogue Flow Simulation with Large Language Models

Self-training Improves Pre-training for Few-shot Learning in Task-oriented Dialog Systems

Enhancing Dialogue State Tracking Models through LLM-backed User-Agents Simulation