Multi-modal large language models in radiology: principles, applications, and potential

Yiqiu Shen,Yanqi Xu,Jiajian Ma,Wushuang Rui,Chen Zhao,Laura Heacock,Chenchan Huang
DOI: https://doi.org/10.1007/s00261-024-04708-8
IF: 2.4
2024-12-04
Abdominal Radiology
Abstract:Large language models (LLMs) and multi-modal large language models (MLLMs) represent the cutting-edge in artificial intelligence. This review provides a comprehensive overview of their capabilities and potential impact on radiology. Unlike most existing literature reviews focusing solely on LLMs, this work examines both LLMs and MLLMs, highlighting their potential to support radiology workflows such as report generation, image interpretation, EHR summarization, differential diagnosis generation, and patient education. By streamlining these tasks, LLMs and MLLMs could reduce radiologist workload, improve diagnostic accuracy, support interdisciplinary collaboration, and ultimately enhance patient care. We also discuss key limitations, such as the limited capacity of current MLLMs to interpret 3D medical images and to integrate information from both image and text data, as well as the lack of effective evaluation methods. Ongoing efforts to address these challenges are introduced.
radiology, nuclear medicine & medical imaging
What problem does this paper attempt to address?