Application, investigation and prediction of ChatGpt/GPT-4 for clinical cases in medical field

Xingyu Zhao
DOI: https://doi.org/10.54254/2755-2721/54/20241225
2024-03-29
Abstract:The integration of Artificial Intelligence (AI) and medical treatment not only makes the clinical diagnosis more accurate, but also makes the patient's rehabilitation more systematic and professional, especially after the advent of the Large Language Model(LLM) in the past 2 years. This paper discusses 3 clinical cases of 2 kinds of LLMs: ChatGPT (GPT-3.5) and GPT-4 in Physical Medicine and Rehabilitation (PM&R), and shows their powerful analytical reasoning ability. In the first experiment, ChatGPT and the leading professional doctors in the industry were asked to classify the emergency records of ophthalmology during the 10-year period, infer the severity of each patient's illness and determine the nursing requirements. In the later experiment, GPT-4, an upgraded version of GPT-3.5, delayed the diagnosis of medical history data of patients aged 65 and over, to study the clinical diagnosis opinions and systematic treatment scheme of GPT-4 as a "professional doctor". ChatGPT and GPT-4 participated in the examination with 12 categories of neurosurgery medical fields, which was shown in the last experiment, aiming at studying their medical professional level and discussing their clinical reliability and effectiveness, as well as LLMs' ability of reasoning questions step by step. The experimental results show that these 2 kinds of large language models have professional and powerful ability to analyze actual projects, and their performance even far exceeds that of professional clinicians. At the same time, the existing defects of the models and their more applications in the medical field in the future are prospected.
What problem does this paper attempt to address?