A Survey for Large Language Models in Biomedicine

Chong Wang,Mengyao Li,Junjun He,Zhongruo Wang,Erfan Darzi,Zan Chen,Jin Ye,Tianbin Li,Yanzhou Su,Jing Ke,Kaili Qu,Shuxin Li,Yi Yu,Pietro Liò,Tianyun Wang,Yu Guang Wang,Yiqing Shen
2024-08-29
Abstract:Recent breakthroughs in large language models (LLMs) offer unprecedented natural language understanding and generation capabilities. However, existing surveys on LLMs in biomedicine often focus on specific applications or model architectures, lacking a comprehensive analysis that integrates the latest advancements across various biomedical domains. This review, based on an analysis of 484 publications sourced from databases including PubMed, Web of Science, and arXiv, provides an in-depth examination of the current landscape, applications, challenges, and prospects of LLMs in biomedicine, distinguishing itself by focusing on the practical implications of these models in real-world biomedical contexts. Firstly, we explore the capabilities of LLMs in zero-shot learning across a broad spectrum of biomedical tasks, including diagnostic assistance, drug discovery, and personalized medicine, among others, with insights drawn from 137 key studies. Then, we discuss adaptation strategies of LLMs, including fine-tuning methods for both uni-modal and multi-modal LLMs to enhance their performance in specialized biomedical contexts where zero-shot fails to achieve, such as medical question answering and efficient processing of biomedical literature. Finally, we discuss the challenges that LLMs face in the biomedicine domain including data privacy concerns, limited model interpretability, issues with dataset quality, and ethics due to the sensitive nature of biomedical data, the need for highly reliable model outputs, and the ethical implications of deploying AI in healthcare. To address these challenges, we also identify future research directions of LLM in biomedicine including federated learning methods to preserve data privacy and integrating explainable AI methodologies to enhance the transparency of LLMs.
Computation and Language,Artificial Intelligence
What problem does this paper attempt to address?
The paper attempts to address the following issues: 1. **Comprehensive review of the application of large language models (LLMs) in the biomedical field**: Existing reviews on LLMs in biomedicine often focus on specific applications or model architectures, lacking comprehensive integration and analysis of the latest advancements. This paper provides an in-depth examination of the current status, applications, challenges, and prospects of LLMs in the biomedical field, based on 484 papers retrieved from databases such as PubMed, Web of Science, and arXiv. 2. **Evaluating the performance of LLMs in zero-shot learning**: The study explores the zero-shot learning capabilities of LLMs in various biomedical tasks, including diagnostic assistance, drug discovery, and personalized medicine, citing the results of 137 key studies. 3. **Adaptation strategies and performance enhancement**: It discusses how LLMs can enhance their performance in specific biomedical contexts (such as medical question answering and efficient processing of biomedical literature) through fine-tuning methods, especially in cases where zero-shot learning falls short. 4. **Challenges and solutions**: The paper identifies the main challenges faced by LLMs in the biomedical field, such as data privacy issues, lack of model interpretability, varying quality of datasets, and ethical considerations. It also proposes future research directions, such as federated learning to protect data privacy and combining explainable AI techniques to improve transparency. 5. **Promoting responsible deployment of LLMs in the biomedical field**: It emphasizes the importance of continued research and development to fully harness the potential of LLMs in biomedicine, while ensuring they are applied responsibly and effectively.