The use of large language models to enhance cancer clinical trial educational materials

Mingye Gao,Aman Varshney,Shan Chen,Vikram Goddla,Jack Gallifant,Patrick Doyle,Claire Novack,Maeve Dillon-Martin,Teresia Perkins,Xinrong Correia,Erik Duhaime,Howard Isenstein,Elad Sharon,Lisa Soleymani Lehmann,David Kozono,Brian Anthony,Dmitriy Dligach,Danielle S. Bitterman
2024-12-03
Abstract:Cancer clinical trials often face challenges in recruitment and engagement due to a lack of participant-facing informational and educational resources. This study investigated the potential of Large Language Models (LLMs), specifically GPT4, in generating patient-friendly educational content from clinical trial informed consent forms. Using data from <a class="link-external link-http" href="http://ClinicalTrials.gov" rel="external noopener nofollow">this http URL</a>, we employed zero-shot learning for creating trial summaries and one-shot learning for developing multiple-choice questions, evaluating their effectiveness through patient surveys and crowdsourced annotation. Results showed that GPT4-generated summaries were both readable and comprehensive, and may improve patients' understanding and interest in clinical trials. The multiple-choice questions demonstrated high accuracy and agreement with crowdsourced annotators. For both resource types, hallucinations were identified that require ongoing human oversight. The findings demonstrate the potential of LLMs "out-of-the-box" to support the generation of clinical trial education materials with minimal trial-specific engineering, but implementation with a human-in-the-loop is still needed to avoid misinformation risks.
Computation and Language,Artificial Intelligence
What problem does this paper attempt to address?