Self-Explanation Prompting Improves Dialogue Understanding in Large Language Models

Haoyu Gao,Ting-En Lin,Hangyu Li,Min Yang,Yuchuan Wu,Wentao Ma,Yongbin Li
2023-09-22
Abstract:Task-oriented dialogue (TOD) systems facilitate users in executing various activities via multi-turn dialogues, but Large Language Models (LLMs) often struggle to comprehend these intricate contexts. In this study, we propose a novel "Self-Explanation" prompting strategy to enhance the comprehension abilities of LLMs in multi-turn dialogues. This task-agnostic approach requires the model to analyze each dialogue utterance before task execution, thereby improving performance across various dialogue-centric tasks. Experimental results from six benchmark datasets confirm that our method consistently outperforms other zero-shot prompts and matches or exceeds the efficacy of few-shot prompts, demonstrating its potential as a powerful tool in enhancing LLMs' comprehension in complex dialogue tasks.
Computation and Language,Artificial Intelligence
What problem does this paper attempt to address?
### Problems the Paper Attempts to Solve This paper aims to address the issue of large language models (LLMs) lacking the ability to understand complex contexts in multi-turn dialogue tasks. Although LLMs have achieved significant success in various natural language processing (NLP) tasks, they often struggle to comprehend these complex dialogue contexts when performing multi-turn dialogue tasks. To solve this problem, the authors propose a new "Self-Explanation Prompting" strategy, which requires the model to explain each dialogue segment before performing the task, thereby improving its understanding ability in multi-turn dialogues. ### Main Contributions 1. **Comparative Analysis**: The authors comprehensively compare reasoning tasks and dialogue understanding tasks, pointing out the limitations of existing prompting methods in dialogue understanding tasks. 2. **Proposed Method**: A simple and effective prompting strategy—"Self-Explanation"—is proposed, which significantly enhances the understanding ability of large language models in multi-turn dialogues. 3. **Experimental Validation**: Extensive experiments were conducted on 6 dialogue-centric datasets, showing that this method outperforms existing zero-shot and few-shot prompting methods in terms of performance. ### Method Overview - **Formal Definition**: The problem is divided into two parts: context (C) and question (Q). The context provides the background and setting for the question, which is a specific inquiry based on the context. - **Self-Explanation**: Inspired by human cognitive processes, a zero-shot prompting technique is proposed, requiring the model to first explain each segment in the multi-turn dialogue and then complete the task based on the generated explanations. This method helps the model better understand the dialogue flow and given patterns through detailed sentence-by-sentence explanations. ### Experimental Results - **Datasets**: Evaluations were conducted on 6 different types of dialogue understanding datasets, including task-oriented dialogue (TOD), emotion recognition (ERC), and response selection (RS) tasks. - **Performance Improvement**: Experimental results show that the self-explanation prompting method performs excellently on all evaluated datasets compared to existing zero-shot and few-shot prompting methods, with particularly significant performance improvements in TOD tasks. ### Conclusion The paper points out that existing chain-of-thought (CoT) prompting methods perform poorly in multi-turn dialogue tasks because these tasks rely more on context understanding rather than complex reasoning steps. Therefore, the self-explanation prompting strategy proposed by the authors can significantly enhance the understanding ability of large language models in multi-turn dialogues, with broad application prospects.