Comparing Commercial and Open-Source Large Language Models for Labeling Chest Radiograph Reports

Felix J Dorfner,Liv Jürgensen,Leonhard Donle,Fares Al Mohamad,Tobias R Bodenmann,Mason C Cleveland,Felix Busch,Lisa C Adams,James Sato,Thomas Schultz,Albert E Kim,Jameson Merkow,Keno K Bressem,Christopher P Bridge
DOI: https://doi.org/10.1148/radiol.241139
IF: 19.7
2024-10-30
Radiology
Abstract:Background Rapid advances in large language models (LLMs) have led to the development of numerous commercial and open-source models. While recent publications have explored OpenAI's GPT-4 to extract information of interest from radiology reports, there has not been a real-world comparison of GPT-4 to leading open-source models. Purpose To compare different leading open-source LLMs to GPT-4 on the task of extracting relevant findings from chest radiograph reports. Materials and Methods Two...
radiology, nuclear medicine & medical imaging
What problem does this paper attempt to address?