Comparative analysis of GPT-4-based ChatGPT's diagnostic performance with radiologists using real-world radiology reports of brain tumors

Yasuhito Mitsuyama,Hiroyuki Tatekawa,Hirotaka Takita,Fumi Sasaki,Akane Tashiro,Satoshi Oue,Shannon L. Walston,Yuta Nonomiya,Ayumi Shintani,Yukio Miki,Daiju Ueda
DOI: https://doi.org/10.1007/s00330-024-11032-8
IF: 7.034
2024-08-31
European Radiology
Abstract:Large language models like GPT-4 have demonstrated potential for diagnosis in radiology. Previous studies investigating this potential primarily utilized quizzes from academic journals. This study aimed to assess the diagnostic capabilities of GPT-4-based Chat Generative Pre-trained Transformer (ChatGPT) using actual clinical radiology reports of brain tumors and compare its performance with that of neuroradiologists and general radiologists.
radiology, nuclear medicine & medical imaging
What problem does this paper attempt to address?