The quality and readability of patient information provided by ChatGPT: can AI reliably explain common ENT operations?

Michel Abou-Abdallah,Talib Dar,Yasamin Mahmudzade,Joshua Michaels,Rishi Talwar,Chrysostomos Tornari
DOI: https://doi.org/10.1007/s00405-024-08598-w
2024-03-26
European Archives of Oto-Rhino-Laryngology
Abstract:PurposeAccess to high-quality and comprehensible patient information is crucial. However, information provided by increasingly prevalent Artificial Intelligence tools has not been thoroughly investigated. This study assesses the quality and readability of information from ChatGPT regarding three index ENT operations: tonsillectomy, adenoidectomy, and grommets.MethodsWe asked ChatGPT standard and simplified questions. Readability was calculated using Flesch-Kincaid Reading Ease Score (FRES), Flesch-Kincaid Grade Level (FKGL), Gunning Fog Index (GFI) and Simple Measure of Gobbledygook (SMOG) scores. We assessed quality using the DISCERN instrument and compared these with ENT UK patient leaflets.ResultsChatGPT readability was poor, with mean FRES of 38.9 and 55.1 pre- and post-simplification, respectively. Simplified information from ChatGPT was 43.6% more readable (FRES) but scored 11.6% lower for quality. ENT UK patient information readability and quality was consistently higher.ConclusionsChatGPT can simplify information at the expense of quality, resulting in shorter answers with important omissions. Limitations in knowledge and insight curb its reliability for healthcare information. Patients should use reputable sources from professional organisations alongside clear communication with their clinicians for well-informed consent and making decisions.
otorhinolaryngology
What problem does this paper attempt to address?