Abstract:Objective To evaluate the effectiveness and reasoning ability of ChatGPT in diagnosing retinal vascular diseases in the Chinese clinical environment. Materials and Methods We collected 1226 fundus fluorescein angiography reports and corresponding diagnosis written in Chinese, and tested ChatGPT with four prompting strategies (direct diagnosis or diagnosis with explanation and in Chinese or English). Results ChatGPT using English prompt for direct diagnosis achieved the best performance, with F1-score of 80.05%, which was inferior to ophthalmologists (89.35%) but close to ophthalmologist interns (82.69%). Although ChatGPT can derive reasoning process with a low error rate, mistakes such as misinformation (1.96%), and hallucination (0.59%) still exist. Discussion and Conclusions ChatGPT can serve as a helpful medical assistant to provide diagnosis under non-English clinical environments, but there are still performance gaps, language disparity, and errors compared to professionals, which demonstrates the potential limitations and the desiration to continually explore more robust LLMs in ophthalmology practice. ### Competing Interest Statement The authors have declared no competing interest. ### Funding Statement The work is supported by Natural Science Foundation of China (grant number: 82201195). ### Author Declarations I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained. Yes The details of the IRB/oversight body that provided approval or exemption for the research described are given below: Ethics committee/IRB of Second Affiliated Hospital, School of Medicine, Zhejiang University gave ethical approval for this work.(IRB:[NCT04718532][1]) I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals. Yes I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance). Yes I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable. Yes Data will be made available for research purposes upon request. Data requests are to be directed to jinkai{at}zju.edu.cn. [1]: /lookup/external-ref?link_type=CLINTRIALGOV&access_num=NCT04718532&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F14%2F2023.06.28.23291931.atom

Diagnosing Glaucoma Based on the Ocular Hypertension Treatment Study Dataset Using Chat Generative Pre-Trained Transformer as a Large Language Model

Predicting Glaucoma Before Onset Using a Large Language Model Chatbot

ChatGPT Assisting Diagnosis of Neuro-ophthalmology Diseases Based on Case Reports

Evaluating the strengths and limitations of multimodal ChatGPT-4 in detecting glaucoma using fundus images

Uncovering Language Disparity of ChatGPT in Healthcare: Non-English Clinical Environment for Retinal Vascular Disease Classification (Preprint)

Uncovering Language Disparity of ChatGPT in Healthcare: Non-English Clinical Environment for Retinal Vascular Disease Classification

Bystander effects and radiotherapy.

Performance of ChatGPT in Diagnosis of Corneal Eye Diseases

A User-friendly Approach for the Diagnosis of Diabetic Retinopathy Using ChatGPT and Automated Machine Learning

Evaluating Chatbot responses to patient questions in the field of glaucoma

APPLICATIONS OF MULTIMODAL GENERATIVE ARTIFICIAL INTELLIGENCE IN A REAL-WORLD RETINA CLINIC SETTING

Digital implementation of general purpose fuzzy logic controller for photovoltaic maximum power point tracker

ChatGPT and Beyond: An overview of the growing field of large language models and their use in ophthalmology

Uncovering Language Disparity of ChatGPT on Retinal Vascular Disease Classification: Cross-Sectional Study

Evaluating the Artificial Intelligence Performance Growth in Ophthalmic Knowledge

Diagnostic capabilities of ChatGPT in ophthalmology

Evaluating the potential of ChatGPT-4 in ophthalmology: The good, the bad and the ugly

Accuracy of an Artificial Intelligence Chatbot’s Interpretation of Clinical Ophthalmic Images

ChatGPT and retinal disease: a cross-sectional study on AI comprehension of clinical guidelines

Performance of Popular Large Language Models in Glaucoma Patient Education: a Randomized Controlled Study

Decoupled Farm Payments and the Role of Base Updating Under Uncertainty