Cause-Aware Empathetic Response Generation via Chain-of-Thought Fine-Tuning

Xinhao Chen,Chong Yang,Man Lan,Li Cai,Yang Chen,Tu Hu,Xinlin Zhuang,Aimin Zhou
2024-08-21
Abstract:Empathetic response generation endows agents with the capability to comprehend dialogue contexts and react to expressed emotions. Previous works predominantly focus on leveraging the speaker's emotional labels, but ignore the importance of emotion cause reasoning in empathetic response generation, which hinders the model's capacity for further affective understanding and cognitive inference. In this paper, we propose a cause-aware empathetic generation approach by integrating emotions and causes through a well-designed Chain-of-Thought (CoT) prompt on Large Language Models (LLMs). Our approach can greatly promote LLMs' performance of empathy by instruction tuning and enhancing the role awareness of an empathetic listener in the prompt. Additionally, we propose to incorporate cause-oriented external knowledge from COMET into the prompt, which improves the diversity of generation and alleviates conflicts between internal and external knowledge at the same time. Experimental results on the benchmark dataset demonstrate that our approach on LLaMA-7b achieves state-of-the-art performance in both automatic and human evaluations.
Computation and Language,Artificial Intelligence
What problem does this paper attempt to address?
### Problems Addressed by the Paper This paper aims to address the issue of existing methods neglecting emotion cause reasoning when generating empathetic responses. Specifically, current research mainly relies on the speaker's emotion labels to generate empathetic responses but overlooks the importance of understanding the reasons behind the emotions for empathetic comprehension. This leads to limitations in the model's ability for deeper emotional understanding and cognitive reasoning. ### Main Contributions 1. **Proposed a Chain-of-Thought (CoT) fine-tuning method based on emotion causes**: By introducing emotion cause reasoning into large language models (LLMs), the model's empathetic capabilities are enhanced. 2. **Integrated emotion cause-oriented COMET knowledge into CoT prompts**: By combining emotion causes with external commonsense knowledge, the diversity and consistency of generated responses are improved. 3. **Conducted extensive experiments**: Validated the superior performance of the proposed method on benchmark datasets, generating responses that are more empathetic and explanatory. ### Method Overview 1. **Causality-aware CoT prompt construction**: Designed a general CoT generation template to guide the model in reflecting on emotions and their causes, enhancing its role awareness as an empathetic listener. 2. **Emotion cause-oriented COMET knowledge integration**: Generated relevant commonsense knowledge using emotion cause fragments and integrated them into the prompts to improve the consistency and diversity of generated responses. 3. **Instruction fine-tuning**: Enhanced the model's empathetic expression ability through instruction fine-tuning, designing natural language output templates that include emotional reasoning and responses. 4. **Demonstration and loss function**: Utilized a few examples for In-Context Learning (ICL) and optimized model performance through supervised fine-tuning. ### Experimental Results - **Automatic Evaluation**: The CFEG method significantly outperformed baseline models on multiple metrics, especially in terms of emotion accuracy, BLEU score, and response diversity. - **Human Evaluation**: The CFEG method also excelled in empathy, informativeness, and coherence, although it was slightly inferior to ChatGPT in fluency. - **Ablation Study**: Verified the effectiveness of each module, particularly the importance of causal reasoning and listener-aware reasoning in empathetic generation. ### Conclusion By introducing emotion cause reasoning and instruction fine-tuning, this paper significantly enhances the performance of large language models in generating empathetic responses. Experimental results show that the proposed method performs excellently in both automatic and human evaluations, generating responses that are more empathetic and explanatory.