Abstract:This paper propose to combine pretrained language models with the modular dialogue paradigm for open-domain dialogue modeling. Our method, semantic-enhanced finetuning, instantiates conversation understanding, planning, and response generation as a language model finetuning task. At inference, we disentangle semantic and token variations by specifying sampling methods and constraints for each module separately. For training and evaluation, we present X-Weibo, a Chinese multi-turn open-domain dialogue dataset with automatic annotation for emotions, DAs, and topical words. Experiments show that semantic-enhanced finetuning outperforms strong baselines on non-semantic and semantic metrics, improves the human-evaluated relevance, coherence, and informativeness, and exhibits considerable controllability over semantic variables.

What problem does this paper attempt to address?

### Problems the Paper Attempts to Solve The paper aims to combine pre-trained language models with a modular dialogue paradigm to improve the performance of open-domain dialogue systems. Specifically, the paper proposes a method called "Semantic-Enhanced Finetuning," which models dialogue understanding, planning, and response generation as finetuning tasks of a language model. In this way, the paper attempts to address the following issues: 1. **Improving the interpretability and controllability of dialogue systems**: Traditional open-domain dialogue systems often lack explicit modeling of semantic information during the dialogue process, resulting in poor interpretability and controllability. By introducing a modular dialogue paradigm, the paper enables the system to better understand and control semantic variables such as emotions, dialogue acts (DAs), and topics in the dialogue. 2. **Enhancing dialogue quality**: By explicitly modeling semantic variables during the finetuning process, the paper aims to improve the relevance, coherence, and informativeness of the dialogue system. Experimental results show that the Semantic-Enhanced Finetuning method outperforms strong baseline models on both non-semantic and semantic metrics. 3. **Addressing the scalability issue of large-scale dialogue data annotation**: To train and evaluate the model, the paper introduces a Chinese multi-turn open-domain dialogue dataset named X-WEIBO, which contains automatically annotated emotions, dialogue acts, and topic words. By using pre-trained classifiers to automatically annotate semantic variables, the paper addresses the scalability issue of manual annotation for large-scale dialogue data. ### Summary By combining pre-trained language models and a modular dialogue paradigm, the paper proposes a new method—Semantic-Enhanced Finetuning—to improve the performance of open-domain dialogue systems. This method not only enhances the quality of dialogues but also improves the interpretability and controllability of the system. Additionally, the paper addresses the scalability issue of large-scale dialogue data annotation, providing strong support for further research on open-domain dialogue systems.

Semantic-Enhanced Explainable Finetuning for Open-Domain Dialogues

Semantic-based Pre-training for Dialogue Understanding

An Empirical Investigation of Pre-Trained Transformer Language Models for Open-Domain Dialogue Generation

DialogueBERT: A Self-Supervised Learning based Dialogue Pre-training Encoder

Multi-Stage Pre-training Enhanced by ChatGPT for Multi-Scenario Multi-Domain Dialogue Summarization

End-to-End Trainable Non-Collaborative Dialog System

Human–Machine Multi-Turn Language Dialogue Interaction Based on Deep Learning

SPECTRUM: Speaker-Enhanced Pre-Training for Long Dialogue Summarization

Baichuan2-Sum: Instruction Finetune Baichuan2-7B Model for Dialogue Summarization

xDial-Eval: A Multilingual Open-Domain Dialogue Evaluation Benchmark

GALAXY: A Generative Pre-trained Model for Task-Oriented Dialog with Semi-supervised Learning and Explicit Policy Injection

DialogLM: Pre-trained Model for Long Dialogue Understanding and Summarization

Enhancing Abstractive Dialogue Summarization with Internal Knowledge

Domain-Oriented Prefix-Tuning: Towards Efficient and Generalizable Fine-tuning for Zero-Shot Dialogue Summarization

Towards Explainable and Controllable Open Domain Dialogue Generation with Dialogue Acts

Go Beyond Plain Fine-tuning: Improving Pretrained Models for Social Commonsense

Building Dialogue Understanding Models for Low-resource Language Indonesian from Scratch

Fine-grained Post-training for Improving Retrieval-based Dialogue Systems

Understanding Chinese Moral Stories with Further Pre-Training

Enhancing the Open-Domain Dialogue Evaluation in Latent Space

DialogVED: A Pre-trained Latent Variable Encoder-Decoder Model for Dialog Response Generation