Abstract:Objective To evaluate the effectiveness and reasoning ability of ChatGPT in diagnosing retinal vascular diseases in the Chinese clinical environment. Materials and Methods We collected 1226 fundus fluorescein angiography reports and corresponding diagnosis written in Chinese, and tested ChatGPT with four prompting strategies (direct diagnosis or diagnosis with explanation and in Chinese or English). Results ChatGPT using English prompt for direct diagnosis achieved the best performance, with F1-score of 80.05%, which was inferior to ophthalmologists (89.35%) but close to ophthalmologist interns (82.69%). Although ChatGPT can derive reasoning process with a low error rate, mistakes such as misinformation (1.96%), and hallucination (0.59%) still exist. Discussion and Conclusions ChatGPT can serve as a helpful medical assistant to provide diagnosis under non-English clinical environments, but there are still performance gaps, language disparity, and errors compared to professionals, which demonstrates the potential limitations and the desiration to continually explore more robust LLMs in ophthalmology practice. ### Competing Interest Statement The authors have declared no competing interest. ### Funding Statement The work is supported by Natural Science Foundation of China (grant number: 82201195). ### Author Declarations I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained. Yes The details of the IRB/oversight body that provided approval or exemption for the research described are given below: Ethics committee/IRB of Second Affiliated Hospital, School of Medicine, Zhejiang University gave ethical approval for this work.(IRB:[NCT04718532][1]) I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals. Yes I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance). Yes I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable. Yes Data will be made available for research purposes upon request. Data requests are to be directed to jinkai{at}zju.edu.cn. [1]: /lookup/external-ref?link_type=CLINTRIALGOV&access_num=NCT04718532&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F14%2F2023.06.28.23291931.atom

Development and evaluation of a large language model of ophthalmology in Chinese

TCMChat: A Generative Large Language Model for Traditional Chinese Medicine

Medical, moral and legal aspects of renal replacement therapy.

Evaluating Large Language Models in Ophthalmology

Evaluating multiple large language models in pediatric ophthalmology

EyeGPT: Ophthalmic Assistant with Large Language Models

Uncovering Language Disparity of ChatGPT in Healthcare: Non-English Clinical Environment for Retinal Vascular Disease Classification (Preprint)

Ophtha-LLaMA2: A Large Language Model for Ophthalmology

A case of IgE multiple myeloma

Large Language Models Leverage External Knowledge to Extend Clinical Insight Beyond Language Boundaries

OphGLM: Training an Ophthalmology Large Language-and-Vision Assistant based on Instructions and Dialogue

A Survey of Large Language Models in Medicine: Progress, Application, and Challenge

Review of emerging trends and projection of future developments in large language models research in ophthalmology

A Role-specific Guided Large Language Model for Ophthalmic Consultation Based on Stylistic Differentiation

Uncovering Language Disparity of ChatGPT in Healthcare: Non-English Clinical Environment for Retinal Vascular Disease Classification

OphGLM: An ophthalmology large language-and-vision assistant

Improving Clinical Expertise in Large Language Models Using Electronic Medical Records

DoctorGPT: A Large Language Model with Chinese Medical Question-Answering Capabilities

Source and risk assessment of PCBs in sediments of Fenhe reservoir and watershed, China.

Clinical application potential of large language model: a study based on thyroid nodules

Fine-tuning Large Language Model (LLM) Artificial Intelligence Chatbots in Ophthalmology and LLM-based evaluation using GPT-4