An Active Inference Strategy for Prompting Reliable Responses from Large Language Models in Medical Practice

Roma Shusterman,Allison C. Waters,Shannon O`Neill,Phan Luu,Don M. Tucker
2024-07-23
Abstract:Continuing advances in Large Language Models (LLMs) in artificial intelligence offer important capacities in intuitively accessing and using medical knowledge in many contexts, including education and training as well as assessment and treatment. Most of the initial literature on LLMs in medicine has emphasized that LLMs are unsuitable for medical use because they are non-deterministic, may provide incorrect or harmful responses, and cannot be regulated to assure quality control. If these issues could be corrected, optimizing LLM technology could benefit patients and physicians by providing affordable, point-of-care medical knowledge. Our proposed framework refines LLM responses by restricting their primary knowledge base to domain-specific datasets containing validated medical information. Additionally, we introduce an actor-critic LLM prompting protocol based on active inference principles of human cognition, where a Therapist agent initially responds to patient queries, and a Supervisor agent evaluates and adjusts responses to ensure accuracy and reliability. We conducted a validation study where expert cognitive behaviour therapy for insomnia (CBT-I) therapists evaluated responses from the LLM in a blind format. Experienced human CBT-I therapists assessed responses to 100 patient queries, comparing LLM-generated responses with appropriate and inappropriate responses crafted by experienced CBT-I therapists. Results showed that LLM responses received high ratings from the CBT-I therapists, often exceeding those of therapist-generated appropriate responses. This structured approach aims to integrate advanced LLM technology into medical applications, meeting regulatory requirements for establishing the safe and effective use of special purpose validated LLMs in medicine.
Computation and Language
What problem does this paper attempt to address?
The main problems that this paper attempts to solve are the non - determinism of current large - language models (LLMs) in medical applications, the possibility of providing incorrect or harmful responses, and the difficulty in achieving quality control. Specifically, the author points out that although LLMs have great potential for accessing and using medical knowledge, their inherent non - determinism and potential risks limit their applications in medical practice. If these problems can be solved, optimizing LLM technology can provide affordable and immediate medical knowledge for patients and doctors, especially among underserved populations. To solve the above problems, the author proposes a framework to optimize LLM responses. The main measures include: 1. **Limit the knowledge base**: Limit the main knowledge base of LLM to a specific - domain data set containing verified medical information to generate more accurate responses. 2. **Introduce an actor - critic LLM prompting protocol based on the principle of active inference**: In this protocol, a "therapist" agent first responds to the patient's query, while a "supervisor" agent evaluates and adjusts these responses to ensure accuracy and reliability. Through these methods, the research aims to more effectively integrate advanced LLM technology into medical applications while meeting regulatory requirements and ensuring the safe and effective use of special - purpose - verified LLMs in medicine. To verify the effectiveness of this method, the research also conducted a validation study. In this study, experienced cognitive - behavioral - therapy - for - insomnia (CBT - I) therapists blindly evaluated the responses generated by LLM. The results showed that the LLM responses received high scores, even exceeding the appropriate responses generated by human therapists. This indicates that through a structured framework, the reliability and accuracy of LLM in medical applications can be effectively improved.