S461 AI Language Models vs Medical Guidelines: A Comparative Study of ChatGPT Models in Advising Colonoscopy Follow-Up Intervals

Akash Patel,Adewale Ajumobi
DOI: https://doi.org/10.14309/01.ajg.0001031212.41441.a8
2024-10-26
The American Journal of Gastroenterology
Abstract:The application of large language models (LLMs) in healthcare has been a growing area of interest, particularly in providing patient-specific recommendations. This study evaluates the accuracy of different LLMs in recommending colonoscopy follow-up intervals based on the United States Multi-Society Task Force on Colorectal Cancer (USMSTF) 2020 guidelines. We compared ChatGPT 4 Omni, ChatGPT 4, and ChatGPT 3.5 to determine which model aligns most closely with the 2020 USMSTF guidelines.
gastroenterology & hepatology
What problem does this paper attempt to address?