Roleplay-doh: Enabling Domain-Experts to Create LLM-simulated Patients via Eliciting and Adhering to Principles

Ryan Louie,Ananjan Nandi,William Fang,Cheng Chang,Emma Brunskill,Diyi Yang
2024-07-15
Abstract:Recent works leverage LLMs to roleplay realistic social scenarios, aiding novices in practicing their social skills. However, simulating sensitive interactions, such as in mental health, is challenging. Privacy concerns restrict data access, and collecting expert feedback, although vital, is laborious. To address this, we develop Roleplay-doh, a novel human-LLM collaboration pipeline that elicits qualitative feedback from a domain-expert, which is transformed into a set of principles, or natural language rules, that govern an LLM-prompted roleplay. We apply this pipeline to enable senior mental health supporters to create customized AI patients for simulated practice partners for novice counselors. After uncovering issues in GPT-4 simulations not adhering to expert-defined principles, we also introduce a novel principle-adherence prompting pipeline which shows 30% improvements in response quality and principle following for the downstream task. Via a user study with 25 counseling experts, we demonstrate that the pipeline makes it easy and effective to create AI patients that more faithfully resemble real patients, as judged by creators and third-party counselors. See our project website at <a class="link-external link-https" href="https://roleplay-doh.github.io/" rel="external noopener nofollow">this https URL</a> for code and data.
Computation and Language,Human-Computer Interaction
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to use large language models (LLMs) to create realistic simulated patients in sensitive fields (such as mental health) for novice counselors to conduct practical training. Specifically, the paper focuses on the following aspects: 1. **Privacy issues**: In the field of mental health, due to privacy protection, it is difficult to obtain real - patient data for training or fine - tuning LLMs. 2. **Collection of expert feedback**: Although expert feedback is crucial for improving the authenticity of LLMsimulation, traditional collection methods are both time - consuming and labor - intensive. 3. **Simulation quality**: Existing LLMsimulations have difficulties in generating high - quality responses that conform to the principles defined by experts, especially when dealing with multi - part principles or multi - situation principles. To solve these problems, the paper proposes a new tool named Roleplay - doh, which achieves its goals in the following ways: - **Principle extraction**: Allows domain experts to define the behavioral principles of simulated patients through natural - language feedback. These feedbacks are converted into specific natural - language rules to guide the simulation behavior of LLMs. - **Principle - following pipeline**: Introduces a new principle - following pipeline, which decomposes complex multi - part principles into simple "yes/no" questions and evaluates the applicability of each principle in the current conversation situation, thereby improving the accuracy and consistency of the simulation. Through these methods, Roleplay - doh aims to enable domain experts to create and customize realistic AI patients more easily, thus providing more effective training resources for novice counselors.