Abstract:Background: Glaucoma is one of the major causes of blindness; it is estimated that over 110 million people will be affected by glaucoma worldwide by 2040. Research on glaucoma detection using deep learning technology has been increasing, but the diagnosis of glaucoma in a large population with high incidence of myopia remains a challenge. This study aimed to provide a decision support system for the automatic detection of glaucoma using fundus images, which can be applied for general screening, especially in areas of high incidence of myopia. Methods: A total of 1,155 fundus images were acquired from 667 individuals with a mean axial length of 25.60 ± 2.0 mm at the National Taiwan University Hospital, Hsinchu Br. These images were graded based on the findings of complete ophthalmology examinations, visual field test, and optical coherence tomography into three groups: normal (N, n = 596), pre-perimetric glaucoma (PPG, n = 66), and glaucoma (G, n = 493), and divided into a training-validation (N: 476, PPG: 55, G: 373) and test (N: 120, PPG: 11, G: 120) sets. A multimodal model with the Xception model as image feature extraction and machine learning algorithms [random forest (RF), support vector machine (SVM), dense neural network (DNN), and others] was applied. Results: The Xception model classified the N, PPG, and G groups with 93.9% of the micro-average area under the receiver operating characteristic curve (AUROC) with tenfold cross-validation. Although normal and glaucoma sensitivity can reach 93.51% and 86.13% respectively, the PPG sensitivity was only 30.27%. The AUROC increased to 96.4% in the N + PPG and G groups. The multimodal model with the N + PPG and G groups showed that the AUROCs of RF, SVM, and DNN were 99.56%, 99.59%, and 99.10%, respectively; The N and PPG + G groups had less than 1% difference. The test set showed an overall 3%-5% less AUROC than the validation results. Conclusion: The multimodal model had good AUROC while detecting glaucoma in a population with high incidence of myopia. The model shows the potential for general automatic screening and telemedicine, especially in Asia. Trial registration: The study was approved by the Institutional Review Board of the National Taiwan University Hospital, Hsinchu Branch (no. NTUHHCB 108-025-E).

Multimodal LLMs for Retinal Disease Diagnosis via OCT: Few-Shot vs Single-Shot Learning

The Role of Prompt Engineering for Multimodal LLM Glaucoma Diagnosis

Visual-Textual Integration in LLMs for Medical Diagnosis: A Quantitative Analysis

Evaluating LLM -- Generated Multimodal Diagnosis from Medical Images and Symptom Analysis

Evaluating the strengths and limitations of multimodal ChatGPT-4 in detecting glaucoma using fundus images

Evaluating General Vision-Language Models for Clinical Medicine

LMOD: A Large Multimodal Ophthalmology Dataset and Benchmark for Large Vision-Language Models

OphGLM: Training an Ophthalmology Large Language-and-Vision Assistant based on Instructions and Dialogue

Insight: A Multi-Modal Diagnostic Pipeline using LLMs for Ocular Surface Disease Diagnosis

A 360° View for Large Language Models: Early Detection of Amblyopia in Children using Multi-View Eye Movement Recordings

A 360 Degree View for Large Language Models: Early Detection of Amblyopia in Children using Multi-View Eye Movement Recordings

Multimodal Deep Learning Classifier for Primary Open Angle Glaucoma Diagnosis Using Wide-Field Optic Nerve Head Cube Scans in Eyes With and Without High Myopia

Use of multimodal dataset in AI for detecting glaucoma based on fundus photographs assessed with OCT: focus group study on high prevalence of myopia

Large Language Models in Ophthalmology: Potential and Pitfalls

Geometric Correspondence-Based Multimodal Learning for Ophthalmic Image Analysis

A fusion of deep neural networks and game theory for retinal disease diagnosis with OCT images

OphGLM: An ophthalmology large language-and-vision assistant

An Early Investigation into the Utility of Multimodal Large Language Models in Medical Imaging

The possibility of the combination of OCT and fundus images for improving the diagnostic accuracy of deep learning for age-related macular degeneration: a preliminary experiment

EyeGPT: Ophthalmic Assistant with Large Language Models

Optimizing Ocular Pathology Classification with CNNs and OCT Imaging: A Systematic and Performance Review