Assessing the proficiency of artificial intelligence programs in the diagnosis and treatment of cornea, conjunctiva, and eyelid diseases and exploring the advantages of each other benefits

Eyupcan Sensoy,Mehmet Citirik
DOI: https://doi.org/10.1016/j.clae.2024.102125
IF: 3.946
2024-04-01
Contact Lens and Anterior Eye
Abstract:PURPOSE: It was aimed to determine the knowledge level of ChatGPT, Bing, and Bard artificial intelligence programs related to corneal, conjunctival, and eyelid diseases and treatment modalities, to examine their reliability and superiority to each other.METHODS: Forty-one questions related to corneal, conjunctival, and eyelid diseases and treatment modalities were asked to the ChatGPT, Bing, and Bard chatbots. The answers to the questions were compared with the answer keys and grouped as correct or incorrect. Accuracy rates were compared.RESULTS: ChatGPT gave the correct answer to 51.2 % of the questions asked, Bing gave the correct answer to 53.7 %, and Bard gave the correct answer to 68.3 %. There was no significant difference in the rate of correct or incorrect answers to the questions asked for the 3 artificial intelligence chatbots (p = 0.208, Pearson's chi-square test).CONCLUSION: Although information about the cornea, conjunctiva, and eyelid diseases and treatment modalities can be accessed quickly and accurately using up-to-date artificial intelligence programs, the answers may not always be accurate and up-to-date. Care should be taken when evaluating this information.
ophthalmology
What problem does this paper attempt to address?