SoulChat: Improving LLMs' Empathy, Listening, and Comfort Abilities through Fine-tuning with Multi-turn Empathy Conversations

Yirong Chen,Xiaofen Xing,Jingkai Lin,Huimin Zheng,Zhenyu Wang,Qi Liu,Xiangmin Xu
2023-11-01
Abstract:Large language models (LLMs) have been widely applied in various fields due to their excellent capability for memorizing knowledge and chain of thought (CoT). When these language models are applied in the field of psychological counseling, they often rush to provide universal advice. However, when users seek psychological support, they need to gain empathy, trust, understanding and comfort, rather than just reasonable advice. To this end, we constructed a multi-turn empathetic conversation dataset of more than 2 million samples, in which the input is the multi-turn conversation context, and the target is empathetic responses that cover expressions such as questioning, comfort, recognition, listening, trust, emotional support, etc. Experiments have shown that the empathy ability of LLMs can be significantly enhanced when finetuning by using multi-turn dialogue history and responses that are closer to the expression of a psychological consultant.
Computation and Language
What problem does this paper attempt to address?
The paper attempts to address three major issues exhibited by large language models (LLMs) in mental health or emotional support conversations: 1. **Repetitive and Standardized Responses**: Existing large language models (such as ChatGPT) tend to use fixed templates when dealing with mental health-related issues, for example: "I'm sorry... {xxx} is... Here are some suggestions:...". This standardized response may make users feel bored. 2. **Too Much Advice Rather Than Listening**: These models tend to provide specific advice rather than understanding the user's needs through questioning or listening. Professional psychological counselors rarely provide specific advice directly during consultations; instead, they guide users more through listening and questioning. 3. **Lack of Empathy**: For users who need listening and comfort, existing large language models often behave like a rational "straight man," lacking the necessary empathy and support. To address these issues, the paper constructs a large-scale multi-turn empathetic dialogue dataset (SoulChatCorpus) and fine-tunes these datasets to improve the empathy, listening, and comforting abilities of large language models in mental health and emotional support conversations. Specifically, the dataset contains over 2 million samples, each of which is a multi-turn conversation covering various ways of expressing empathy, such as questioning, comforting, acknowledging, listening, trusting, and emotional support. Experimental results show that fine-tuning using this dataset can significantly enhance the performance of large language models in these aspects.