Role of visual information in multimodal large language model performance: an evaluation using the Japanese nuclear medicine board examination

Takashi Watanabe,Akira Baba,Takeshi Fukuda,Ken Watanabe,Jun Woo,Hiroya Ojiri
DOI: https://doi.org/10.1007/s12149-024-01992-8
2024-11-20
Annals of Nuclear Medicine
Abstract:This study aimed to assess the performance of state-of-the-art multimodal large language models (LLMs), specifically GPT-4o, Claude 3 Opus, and Gemini 1.5 Pro, on Japanese Nuclear Medicine Board Examination (JNMBE) questions and to evaluate the influence of visual information on the decision-making process.
radiology, nuclear medicine & medical imaging
What problem does this paper attempt to address?