A Large Collection of Model-generated Contradictory Responses for Consistency-aware Dialogue Systems

Shiki Sato,Reina Akama,Jun Suzuki,Kentaro Inui
2024-03-19
Abstract:Mitigating the generation of contradictory responses poses a substantial challenge in dialogue response generation. The quality and quantity of available contradictory response data play a vital role in suppressing these contradictions, offering two significant benefits. First, having access to large contradiction data enables a comprehensive examination of their characteristics. Second, data-driven methods to mitigate contradictions may be enhanced with large-scale contradiction data for training. Nevertheless, no attempt has been made to build an extensive collection of model-generated contradictory responses. In this paper, we build a large dataset of response generation models' contradictions for the first time. Then, we acquire valuable insights into the characteristics of model-generated contradictions through an extensive analysis of the collected responses. Lastly, we also demonstrate how this dataset substantially enhances the performance of data-driven contradiction suppression methods.
Computation and Language
What problem does this paper attempt to address?
### Problems the Paper Attempts to Solve This paper primarily aims to address the issue of automatically generated contradictory responses in dialogue systems. Specifically: 1. **Data Scarcity Issue**: Currently, there is a lack of large-scale datasets of automatically generated contradictory responses. This scarcity leads to two main problems: - Difficulty in comprehensively understanding the characteristics of contradictory responses generated by automatic models (RGM). - Data-driven methods are limited in their effectiveness in suppressing contradictory responses due to insufficient training data. 2. **Dataset Construction**: To address the above issues, the authors constructed a dataset containing a large number of automatically generated contradictory responses and revealed their characteristics through detailed analysis of these responses. Additionally, they demonstrated how to significantly improve the performance of data-driven contradiction suppression methods using this dataset. 3. **Experimental Validation**: Through comparative experiments, it was proven that detectors trained on automatically generated contradictory responses perform better in identifying contradictions generated by actual models compared to detectors trained on manually written contradictory responses.