Assessing the efficacy of 2D and 3D CNN algorithms in OCT-based glaucoma detection

Rafiul Karim Rasel,Fengze Wu,Marion Chiariglione,Stacey S. Choi,Nathan Doble,Xiaoyi Raymond Gao

DOI: https://doi.org/10.1038/s41598-024-62411-6

IF: 4.6

2024-05-25

Scientific Reports

Abstract:Glaucoma is a progressive neurodegenerative disease characterized by the gradual degeneration of retinal ganglion cells, leading to irreversible blindness worldwide. Therefore, timely and accurate diagnosis of glaucoma is crucial, enabling early intervention and facilitating effective disease management to mitigate further vision deterioration. The advent of optical coherence tomography (OCT) has marked a transformative era in ophthalmology, offering detailed visualization of the macula and optic nerve head (ONH) regions. In recent years, both 2D and 3D convolutional neural network (CNN) algorithms have been applied to OCT image analysis. While 2D CNNs rely on post-prediction aggregation of all B-scans within OCT volumes, 3D CNNs allow for direct glaucoma prediction from the OCT data. However, in the absence of extensively pre-trained 3D models, the comparative efficacy of 2D and 3D-CNN algorithms in detecting glaucoma from volumetric OCT images remains unclear. Therefore, this study explores the efficacy of glaucoma detection through volumetric OCT images using select state-of-the-art (SOTA) 2D-CNN models, 3D adaptations of these 2D-CNN models with specific weight transfer techniques, and a custom 5-layer 3D-CNN-Encoder algorithm. The performance across two distinct datasets is evaluated, each focusing on the macula and the ONH, to provide a comprehensive understanding of the models' capabilities in identifying glaucoma. Our findings demonstrate that the 2D-CNN algorithm consistently provided robust results compared to their 3D counterparts tested in this study for glaucoma detection, achieving AUC values of 0.960 and 0.943 for the macular and ONH OCT test images, respectively. Given the scarcity of pre-trained 3D models trained on extensive datasets, this comparative analysis underscores the overall utility of 2D and 3D-CNN algorithms in advancing glaucoma diagnostic systems in ophthalmology and highlights the potential of 2D algorithms for volumetric OCT image-based glaucoma detection.

multidisciplinary sciences

What problem does this paper attempt to address?

The main problem that this paper attempts to solve is to evaluate the efficacy of 2D and 3D Convolutional Neural Network (CNN) algorithms in detecting glaucoma based on Optical Coherence Tomography (OCT) images. Specifically, the researchers hope to determine which model performs better in detecting glaucoma by comparing these different types of CNN models, especially when using volumetric OCT image data. ### Background and Motivation Glaucoma is a progressive neurodegenerative disease, characterized by the gradual degeneration of retinal ganglion cells, which eventually leads to irreversible blindness. Therefore, timely and accurate diagnosis is crucial for early intervention and effective management of the disease to prevent further vision deterioration. The development of Optical Coherence Tomography (OCT) technology has brought revolutionary changes to ophthalmology, as it can provide detailed visual images of the macular area and the Optic Nerve Head (ONH) area. In recent years, 2D and 3D Convolutional Neural Network (CNN) algorithms have been widely used in OCT image analysis. 2D CNN relies on post - prediction aggregation of all B - scans within the OCT volume, while 3D CNN can directly predict glaucoma from OCT data. However, in the absence of widely pre - trained 3D models, the relative efficacy of 2D and 3D - CNN algorithms in detecting glaucoma is not clear. ### Research Objectives To fill this research gap, this study aims to evaluate the efficacy of these models in detecting glaucoma based on volumetric OCT images by using selected state - of - the - art (SOTA) 2D - CNN models, 3D - adapted 2D - CNN models (using a specific weight transfer technique), and a custom - made 5 - layer 3D - CNN - Encoder algorithm. The study conducted performance evaluations on two different datasets, one focusing on the macular area and the other on the optic nerve head, to fully understand the ability of these models to identify glaucoma. ### Main Findings The research results show that the 2D - CNN algorithm performs excellently in glaucoma detection, with an AUC value of 0.960 on OCT test images in the macular area and an AUC value of 0.943 on OCT test images in the optic nerve head. In contrast, the performance of the 3D - CNN model is slightly inferior. This may be due to the lack of large - scale pre - trained 3D models and the problem that 3D models are prone to overfitting when dealing with small - scale datasets. ### Conclusions Although the 3D - CNN model can utilize volumetric information, in the current study, the 2D - CNN model has shown stronger robustness and higher accuracy in glaucoma detection. Future research needs to further explore how to improve the performance of the 3D - CNN model in glaucoma detection, especially in the case of small - scale datasets. These findings provide an important reference for OCT image diagnosis in the field of ophthalmology and point out the direction for future research.

Assessing the efficacy of 2D and 3D CNN algorithms in OCT-based glaucoma detection

Deep Learning for Glaucoma Detection and Identification of Novel Diagnostic Areas in Diverse Real-World Datasets

A feature agnostic approach for glaucoma detection in OCT volumes

Finding New Diagnostic Information for Detecting Glaucoma using Neural Networks

Are Macula or Optic Nerve Head Structures better at Diagnosing Glaucoma? An Answer using AI and Wide-Field Optical Coherence Tomography

Advancing glaucoma detection with convolutional neural networks: a paradigm shift in ophthalmology

Geometric Deep Learning to Identify the Critical 3D Structural Features of the Optic Nerve Head for Glaucoma Diagnosis

Medical Application of Geometric Deep Learning for the Diagnosis of Glaucoma

Three-Dimensional Multi-Task Deep Learning Model to Detect Glaucomatous Optic Neuropathy and Myopic Features From Optical Coherence Tomography Scans: A Retrospective Multi-Centre Study

Are Macula or Optic Nerve Head Structures Better at Diagnosing Glaucoma? An Answer Using Artificial Intelligence and Wide-Field Optical Coherence Tomography

Automatic Diagnosis of Macular Diseases from OCT Volume Based on Its Two-Dimensional Feature Map and Convolutional Neural Network with Attention Mechanism.

A combined convolutional and recurrent neural network for enhanced glaucoma detection

Advancing Diabetic Retinopathy Diagnosis: Leveraging Optical Coherence Tomography Imaging with Convolutional Neural Networks

Sparse Dissimilarity-Constrained Coding for Glaucoma Screening

Optimizing Ocular Pathology Classification with CNNs and OCT Imaging: A Systematic and Performance Review

Diagnosis of glaucoma from retinal fundus images using disc localization and sequential DNN model

Hybrid deep learning models for the screening of Diabetic Macular Edema in optical coherence tomography volumes

Comparison of CNN Algorithms for Feature Extraction on Fundus Images to Detect Glaucoma

Deep-Learning–Aided Diagnosis of Diabetic Retinopathy, Age-Related Macular Degeneration, and Glaucoma Based on Structural and Angiographic OCT

A Comparative Study on Deep Convolutional Neural Networks and Histogram Equalization Techniques for Glaucoma Detection From Fundus Images