Semantic-Enhanced Explainable Finetuning for Open-Domain Dialogues

Yinhe Zheng,Yida Wang,Pei Ke,Zhenyu Yang,Minlie Huang
DOI: https://doi.org/10.48550/arXiv.2106.03065
2022-05-24
Abstract:This paper propose to combine pretrained language models with the modular dialogue paradigm for open-domain dialogue modeling. Our method, semantic-enhanced finetuning, instantiates conversation understanding, planning, and response generation as a language model finetuning task. At inference, we disentangle semantic and token variations by specifying sampling methods and constraints for each module separately. For training and evaluation, we present X-Weibo, a Chinese multi-turn open-domain dialogue dataset with automatic annotation for emotions, DAs, and topical words. Experiments show that semantic-enhanced finetuning outperforms strong baselines on non-semantic and semantic metrics, improves the human-evaluated relevance, coherence, and informativeness, and exhibits considerable controllability over semantic variables.
Computation and Language
What problem does this paper attempt to address?
### Problems the Paper Attempts to Solve The paper aims to combine pre-trained language models with a modular dialogue paradigm to improve the performance of open-domain dialogue systems. Specifically, the paper proposes a method called "Semantic-Enhanced Finetuning," which models dialogue understanding, planning, and response generation as finetuning tasks of a language model. In this way, the paper attempts to address the following issues: 1. **Improving the interpretability and controllability of dialogue systems**: Traditional open-domain dialogue systems often lack explicit modeling of semantic information during the dialogue process, resulting in poor interpretability and controllability. By introducing a modular dialogue paradigm, the paper enables the system to better understand and control semantic variables such as emotions, dialogue acts (DAs), and topics in the dialogue. 2. **Enhancing dialogue quality**: By explicitly modeling semantic variables during the finetuning process, the paper aims to improve the relevance, coherence, and informativeness of the dialogue system. Experimental results show that the Semantic-Enhanced Finetuning method outperforms strong baseline models on both non-semantic and semantic metrics. 3. **Addressing the scalability issue of large-scale dialogue data annotation**: To train and evaluate the model, the paper introduces a Chinese multi-turn open-domain dialogue dataset named X-WEIBO, which contains automatically annotated emotions, dialogue acts, and topic words. By using pre-trained classifiers to automatically annotate semantic variables, the paper addresses the scalability issue of manual annotation for large-scale dialogue data. ### Summary By combining pre-trained language models and a modular dialogue paradigm, the paper proposes a new method—Semantic-Enhanced Finetuning—to improve the performance of open-domain dialogue systems. This method not only enhances the quality of dialogues but also improves the interpretability and controllability of the system. Additionally, the paper addresses the scalability issue of large-scale dialogue data annotation, providing strong support for further research on open-domain dialogue systems.