Large Language Models for Disease Diagnosis: A Scoping Review

Shuang Zhou,Zidu Xu,Mian Zhang,Chunpu Xu,Yawen Guo,Zaifu Zhan,Sirui Ding,Jiashuo Wang,Kaishuai Xu,Yi Fang,Liqiao Xia,Jeremy Yeung,Daochen Zha,Genevieve B. Melton,Mingquan Lin,Rui Zhang
2024-09-19
Abstract:Automatic disease diagnosis has become increasingly valuable in clinical practice. The advent of large language models (LLMs) has catalyzed a paradigm shift in artificial intelligence, with growing evidence supporting the efficacy of LLMs in diagnostic tasks. Despite the increasing attention in this field, a holistic view is still lacking. Many critical aspects remain unclear, such as the diseases and clinical data to which LLMs have been applied, the LLM techniques employed, and the evaluation methods used. In this article, we perform a comprehensive review of LLM-based methods for disease diagnosis. Our review examines the existing literature across various dimensions, including disease types and associated clinical specialties, clinical data, LLM techniques, and evaluation methods. Additionally, we offer recommendations for applying and evaluating LLMs for diagnostic tasks. Furthermore, we assess the limitations of current research and discuss future directions. To our knowledge, this is the first comprehensive review for LLM-based disease diagnosis.
Computation and Language,Artificial Intelligence
What problem does this paper attempt to address?
### Problems the Paper Attempts to Solve The main purpose of this paper is to provide a comprehensive review of research on disease diagnosis using large language models (LLMs). Specifically, the paper attempts to answer the following key questions: 1. **Which diseases and clinical data** are used for LLMs-based diagnostic tasks? 2. **Which LLMs technologies** are applied to disease diagnosis? How to choose the appropriate technology? 3. **Which evaluation methods** are suitable for assessing the performance of these models? By systematically reviewing the existing literature, the paper summarizes the types of different diseases, related clinical specialties, the clinical data used, LLMs technologies, and evaluation methods. Additionally, the paper offers recommendations on data preparation, selecting appropriate LLMs technologies, and adopting suitable evaluation strategies. It also points out the limitations in current research and future research directions. Overall, this review paper aims to provide a comprehensive blueprint for disease diagnosis based on LLMs and to offer guidance and inspiration for future related research. To the best of the authors' knowledge, this is the first comprehensive review specifically focused on the application of LLMs in disease diagnosis.