Evaluating the accuracy and reliability of AI chatbots in disseminating the content of current resuscitation guidelines: a comparative analysis between the ERC 2021 guidelines and both ChatGPTs 3.5 and 4

Stefanie Beck,Manuel Kuhner,Markus Haar,Anne Daubmann,Martin Semmann,Stefan Kluge
DOI: https://doi.org/10.1186/s13049-024-01266-2
2024-09-28
Scandinavian Journal of Trauma Resuscitation and Emergency Medicine
Abstract:Artificial intelligence (AI) chatbots are established as tools for answering medical questions worldwide. Healthcare trainees are increasingly using this cutting-edge technology, although its reliability and accuracy in the context of healthcare remain uncertain. This study evaluated the suitability of Chat-GPT versions 3.5 and 4 for healthcare professionals seeking up-to-date evidence and recommendations for resuscitation by comparing the key messages of the resuscitation guidelines, which methodically set the gold standard of current evidence and recommendations, with the statements of the AI chatbots on this topic.
emergency medicine
What problem does this paper attempt to address?