Comparison of the problem-solving performance of ChatGPT-3.5, ChatGPT-4, Bing Chat, and Bard for the Korean emergency medicine board examination question bank

Go Un Lee,Dae Young Hong,Sin Young Kim,Jong Won Kim,Young Hwan Lee,Sang O Park,Kyeong Ryong Lee
DOI: https://doi.org/10.1097/md.0000000000037325
IF: 1.6
2024-03-13
Medicine
Abstract:Presently, the possibility of using artificial intelligence (AI) in various fields is in-creasing, and innovative changes in this technology are occurring at such a rapid pace that it is being called the era of AI. In the field of medicine, AI shows great potential in early disease diagnosis, individualized patient management, complex data analysis, medical drug production, and medical education. [ 1 ] In particular, large language models (LLMs), such as ChatGPT (OpenAI, San Francisco, CA), Bing Chat (Microsoft, Redmond, WA), and Bard (Google, Mountain View, CA), are attracting attention for their performance and usability because they show considerable potential in providing medical field information and advice and in interacting with patients. [ 2–4 ]
medicine, general & internal
What problem does this paper attempt to address?