Therapy as an NLP Task: Psychologists' Comparison of LLMs and Human Peers in CBT

Zainab Iftikhar,Sean Ransom,Amy Xiao,Jeff Huang
2024-09-04
Abstract:Wider access to therapeutic care is one of the biggest challenges in mental health treatment. Due to institutional barriers, some people seeking mental health support have turned to large language models (LLMs) for personalized therapy, even though these models are largely unsanctioned and untested. We investigate the potential and limitations of using LLMs as providers of evidence-based therapy by using mixed methods clinical metrics. Using HELPERT, a prompt run on a large language model using the same process and training as a comparative group of peer counselors, we replicated publicly accessible mental health conversations rooted in Cognitive Behavioral Therapy (CBT) to compare session dynamics and counselor's CBT-based behaviors between original peer support sessions and their reconstructed HELPERT sessions. Two licensed, CBT-trained clinical psychologists evaluated the sessions using the Cognitive Therapy Rating Scale and provided qualitative feedback. Our findings show that the peer sessions are characterized by empathy, small talk, therapeutic alliance, and shared experiences but often exhibit therapist drift. Conversely, HELPERT reconstructed sessions exhibit minimal therapist drift and higher adherence to CBT methods but display a lack of collaboration, empathy, and cultural understanding. Through CTRS ratings and psychologists' feedback, we highlight the importance of human-AI collaboration for scalable mental health. Our work outlines the ethical implication of imparting human-like subjective qualities to LLMs in therapeutic settings, particularly the risk of deceptive empathy, which may lead to unrealistic patient expectations and potential harm.
Human-Computer Interaction,Computation and Language
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is to evaluate the effectiveness and limitations of large - language models (LLMs) as providers of psychotherapy services based on cognitive - behavioral therapy (CBT). Specifically, through comparing the performance of human peer counselors and an LLM - based system (called HELPERT) in providing a single - session CBT consultation, the research explored the following aspects: 1. **Comparison of CBT consulting capabilities provided by human peer counselors and LLMs**: The research aims to compare the differences in capabilities between human peer counselors and HELPERT in providing CBT - based consulting services through the professional evaluation of clinical psychologists, especially in terms of performance in the therapeutic alliance, cooperation, method - following degree, and impact on participants. 2. **Performance of LLMs in continuous interaction**: Current research on LLMs mostly focuses on users' preferences for single - interaction, ignoring the behavior of these models in continuous interaction. This study fills this gap by using CBT indicators established in the literature and having clinical psychologists evaluate the models. 3. **Ethical and technical challenges**: The research also focuses on the ethical and technical challenges of using LLMs in psychotherapy scenarios, especially the risks that LLMs may bring when simulating human subjective traits (such as empathy), which may lead to patients having unrealistic expectations or potential harm. 4. **Possibility of human - AI collaboration**: Finally, the research explored how to use the respective advantages of humans and AI through their cooperation to provide safer and more effective mental health support, rather than simply replacing one with the other. Through the exploration of these issues, the research hopes to provide new perspectives and solutions for future mental health support models and promote the development of a more equitable and effective mental health care approach.