A Large Collection of Model-generated Contradictory Responses for Consistency-aware Dialogue Systems

Shiki Sato,Reina Akama,Jun Suzuki,Kentaro Inui

2024-03-19

Abstract:Mitigating the generation of contradictory responses poses a substantial challenge in dialogue response generation. The quality and quantity of available contradictory response data play a vital role in suppressing these contradictions, offering two significant benefits. First, having access to large contradiction data enables a comprehensive examination of their characteristics. Second, data-driven methods to mitigate contradictions may be enhanced with large-scale contradiction data for training. Nevertheless, no attempt has been made to build an extensive collection of model-generated contradictory responses. In this paper, we build a large dataset of response generation models' contradictions for the first time. Then, we acquire valuable insights into the characteristics of model-generated contradictions through an extensive analysis of the collected responses. Lastly, we also demonstrate how this dataset substantially enhances the performance of data-driven contradiction suppression methods.

Computation and Language

What problem does this paper attempt to address?

### Problems the Paper Attempts to Solve This paper primarily aims to address the issue of automatically generated contradictory responses in dialogue systems. Specifically: 1. **Data Scarcity Issue**: Currently, there is a lack of large-scale datasets of automatically generated contradictory responses. This scarcity leads to two main problems: - Difficulty in comprehensively understanding the characteristics of contradictory responses generated by automatic models (RGM). - Data-driven methods are limited in their effectiveness in suppressing contradictory responses due to insufficient training data. 2. **Dataset Construction**: To address the above issues, the authors constructed a dataset containing a large number of automatically generated contradictory responses and revealed their characteristics through detailed analysis of these responses. Additionally, they demonstrated how to significantly improve the performance of data-driven contradiction suppression methods using this dataset. 3. **Experimental Validation**: Through comparative experiments, it was proven that detectors trained on automatically generated contradictory responses perform better in identifying contradictions generated by actual models compared to detectors trained on manually written contradictory responses.

A Large Collection of Model-generated Contradictory Responses for Consistency-aware Dialogue Systems

Red Teaming Language Models for Processing Contradictory Dialogues

Generating Prototypes for Contradiction Detection Using Large Language Models and Linguistic Rules

N-best Response-based Analysis of Contradiction-awareness in Neural Response Generation Models

I like fish, especially dolphins: Addressing Contradictions in Dialogue Modeling

CDConv: A Benchmark for Contradiction Detection in Chinese Conversations

Dissecting Dissonance: Benchmarking Large Multimodal Models Against Self-Contradictory Instructions

Inconsistent dialogue responses and how to recover from them

EXMODD: An EXplanatory Multimodal Open-Domain Dialogue dataset

Generate, Evaluate, and Select: A Dialogue System with a Response Evaluator for Diversity-Aware Response Generation

ContraSolver: Self-Alignment of Language Models by Resolving Internal Preference Contradictions

SparseCL: Sparse Contrastive Learning for Contradiction Retrieval

Generating Diverse Negations from Affirmative Sentences

Synthesizing Adversarial Negative Responses for Robust Response Ranking and Evaluation

ConvoSense: Overcoming Monotonous Commonsense Inferences for Conversational AI

Don't Say That! Making Inconsistent Dialogue Unlikely with Unlikelihood Training

TOAD: Task-Oriented Automatic Dialogs with Diverse Response Styles

Constructing Highly Inductive Contexts for Dialogue Safety through Controllable Reverse Generation

Improving Dialogue Management: Quality Datasets vs Models

Constructing Emotion Consensus and Utilizing Unpaired Data for Empathetic Dialogue Generation

Detecting Response Generation Not Requiring Factual Judgment