Data Extraction from Free-Text Reports on Mechanical Thrombectomy in Acute Ischemic Stroke Using ChatGPT: A Retrospective Analysis

Nils C Lehnen,Franziska Dorn,Isabella C Wiest,Hanna Zimmermann,Alexander Radbruch,Jakob Nikolas Kather,Daniel Paech,Nils C. Lehnen,Isabella C. Wiest,Ariane Panzer
DOI: https://doi.org/10.1148/radiol.232741
IF: 19.7
2024-04-18
Radiology
Abstract:Background Procedural details of mechanical thrombectomy in patients with ischemic stroke are important predictors of clinical outcome and are collected for prospective studies or national stroke registries. To date, these data are collected manually by human readers, a labor-intensive task that is prone to errors. Purpose To evaluate the use of the large language models (LLMs) GPT-4 and GPT-3.5 to extract data from neuroradiology reports on mechanical thrombectomy in patients with ischemic...
radiology, nuclear medicine & medical imaging
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to automatically extract standard procedure data, clinical data, and data on materials or drugs used from free - text neuroradiology reports during mechanical thrombectomy in patients with acute ischemic stroke. Currently, these data are usually manually extracted by humans, which is a labor - intensive task and error - prone. By evaluating the performance of large - language models (such as GPT - 4 and GPT - 3.5) in this task, the paper explores whether these models can be an effective alternative to manual data extraction, thereby improving the efficiency and accuracy of data processing. Specifically, the main objectives of the study include: 1. **Evaluating the performance of GPT - 4 and GPT - 3.5**: Comparing the accuracy of these two models in extracting data from mechanical thrombectomy reports. 2. **Verifying the generalization ability of the models**: Verifying the performance of the models on different datasets by using external reports from different institutions. 3. **Optimizing the prompts**: Improving the extraction accuracy of the models by optimizing the prompts, especially in cases where the performance on certain data points is poor. Through these objectives, the paper aims to explore the application potential of large - language models in medical data processing, especially for tasks that require a large amount of manual labor.