ReflectDiffu:Reflect between Emotion-intent Contagion and Mimicry for Empathetic Response Generation via a RL-Diffusion Framework

Jiahao Yuan,Zixiang Di,Zhiqing Cui,Guisong Yang,Usman Naseem
2024-09-19
Abstract:Empathetic response generation necessitates the integration of emotional and intentional dynamics to foster meaningful interactions. Existing research either neglects the intricate interplay between emotion and intent, leading to suboptimal controllability of empathy, or resorts to large language models (LLMs), which incur significant computational overhead. In this paper, we introduce ReflectDiffu, a lightweight and comprehensive framework for empathetic response generation. This framework incorporates emotion contagion to augment emotional expressiveness and employs an emotion-reasoning mask to pinpoint critical emotional elements. Additionally, it integrates intent mimicry within reinforcement learning for refinement during diffusion. By harnessing an intent twice reflect the mechanism of Exploring-Sampling-Correcting, ReflectDiffu adeptly translates emotional decision-making into precise intent actions, thereby addressing empathetic response misalignments stemming from emotional misrecognition. Through reflection, the framework maps emotional states to intents, markedly enhancing both response empathy and flexibility. Comprehensive experiments reveal that ReflectDiffu outperforms existing models regarding relevance, controllability, and informativeness, achieving state-of-the-art results in both automatic and human evaluations.
Artificial Intelligence,Computation and Language,Machine Learning
What problem does this paper attempt to address?
### The Problem Addressed by the Paper This paper aims to address several key issues in lightweight empathetic response generation models: 1. **Interaction Mechanism between Emotion and Intent**: Existing research often overlooks the complex interplay between emotions and intents, leading to poor controllability of empathy. 2. **Computational Overhead**: Methods relying on large-scale language models (LLMs) are effective but come with high computational costs. 3. **Depth and Flexibility of Emotional Understanding**: Lightweight models mainly depend on external knowledge signals rather than underlying psychological mechanisms, which limits their empathetic capability and flexibility. 4. **Lack of Multi-task Datasets**: Existing lightweight models lack unified multi-task dataset support for emotional reasoning masking, intent prediction, and empathetic dialogue generation. To address these issues, the paper proposes a framework named **ReflectDiffu**, which combines emotional contagion and intent imitation mechanisms and refines intents during the diffusion process through reinforcement learning. Specifically, ReflectDiffu improves empathetic response generation in the following ways: - Introducing an Emotional Reasoning Annotator (ERA) to enhance emotional understanding. - Proposing an Intent Twice mechanism, i.e., Exploring-Sampling-Correcting mechanism, to improve the consistency between emotions and intents. - Utilizing large-scale language models to extend annotations on the EmpatheticDialogue dataset and create a multi-task dataset. Experimental results show that ReflectDiffu outperforms existing empathetic dialogue generation models in both automatic and human evaluations, excelling in relevance, controllability, and informativeness.