Towards Automated Psychotherapy via Language Modeling

Houjun Liu
DOI: https://doi.org/10.48550/arXiv.2104.10661
2021-04-05
Abstract:In this experiment, a model was devised, trained, and evaluated to automate psychotherapist/client text conversations through the use of state-of-the-art, Seq2Seq Transformer-based Natural Language Generation (NLG) systems. Through training the model upon a mix of the Cornell Movie Dialogue Corpus for language understanding and an open-source, anonymized, and public licensed psychotherapeutic dataset, the model achieved statistically significant performance in published, standardized qualitative benchmarks against human-written validation data - meeting or exceeding human-written responses' performance in 59.7% and 67.1% of the test set for two independent test methods respectively. Although the model cannot replace the work of psychotherapists entirely, its ability to synthesize human-appearing utterances for the majority of the test set serves as a promising step towards communizing and easing stigma at the psychotherapeutic point-of-care.
Computation and Language
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to automate the dialogue process in psychotherapy by using advanced natural language generation (NLG) techniques, especially Transformer - based sequence - to - sequence models. Specifically, the author aims to develop a model that can automatically generate text dialogues similar to those between human psychotherapists and clients, in order to reduce the workload of professional psychotherapists and decrease people's negative views on psychotherapy, so that more people can have easier access to psychotherapy services. To achieve this goal, the author proposes a method. That is, first train a basic language understanding model on a large movie dialogue corpus, and then gradually introduce a smaller psychotherapy dataset for fine - tuning, so that the model can better adapt to the psychotherapy scenario. This method not only helps to alleviate the problem of insufficient data, but also can improve the performance of the model on specific tasks while maintaining its generalization ability. In this way, the author hopes that the model can generate high - quality dialogue responses at a human - like level, and thus replace or assist human psychotherapists' work to a certain extent.