Assessing the efficacy of 2D and 3D CNN algorithms in OCT-based glaucoma detection

Rafiul Karim Rasel,Fengze Wu,Marion Chiariglione,Stacey S. Choi,Nathan Doble,Xiaoyi Raymond Gao
DOI: https://doi.org/10.1038/s41598-024-62411-6
IF: 4.6
2024-05-25
Scientific Reports
Abstract:Glaucoma is a progressive neurodegenerative disease characterized by the gradual degeneration of retinal ganglion cells, leading to irreversible blindness worldwide. Therefore, timely and accurate diagnosis of glaucoma is crucial, enabling early intervention and facilitating effective disease management to mitigate further vision deterioration. The advent of optical coherence tomography (OCT) has marked a transformative era in ophthalmology, offering detailed visualization of the macula and optic nerve head (ONH) regions. In recent years, both 2D and 3D convolutional neural network (CNN) algorithms have been applied to OCT image analysis. While 2D CNNs rely on post-prediction aggregation of all B-scans within OCT volumes, 3D CNNs allow for direct glaucoma prediction from the OCT data. However, in the absence of extensively pre-trained 3D models, the comparative efficacy of 2D and 3D-CNN algorithms in detecting glaucoma from volumetric OCT images remains unclear. Therefore, this study explores the efficacy of glaucoma detection through volumetric OCT images using select state-of-the-art (SOTA) 2D-CNN models, 3D adaptations of these 2D-CNN models with specific weight transfer techniques, and a custom 5-layer 3D-CNN-Encoder algorithm. The performance across two distinct datasets is evaluated, each focusing on the macula and the ONH, to provide a comprehensive understanding of the models' capabilities in identifying glaucoma. Our findings demonstrate that the 2D-CNN algorithm consistently provided robust results compared to their 3D counterparts tested in this study for glaucoma detection, achieving AUC values of 0.960 and 0.943 for the macular and ONH OCT test images, respectively. Given the scarcity of pre-trained 3D models trained on extensive datasets, this comparative analysis underscores the overall utility of 2D and 3D-CNN algorithms in advancing glaucoma diagnostic systems in ophthalmology and highlights the potential of 2D algorithms for volumetric OCT image-based glaucoma detection.
multidisciplinary sciences
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is to evaluate the efficacy of 2D and 3D Convolutional Neural Network (CNN) algorithms in detecting glaucoma based on Optical Coherence Tomography (OCT) images. Specifically, the researchers hope to determine which model performs better in detecting glaucoma by comparing these different types of CNN models, especially when using volumetric OCT image data. ### Background and Motivation Glaucoma is a progressive neurodegenerative disease, characterized by the gradual degeneration of retinal ganglion cells, which eventually leads to irreversible blindness. Therefore, timely and accurate diagnosis is crucial for early intervention and effective management of the disease to prevent further vision deterioration. The development of Optical Coherence Tomography (OCT) technology has brought revolutionary changes to ophthalmology, as it can provide detailed visual images of the macular area and the Optic Nerve Head (ONH) area. In recent years, 2D and 3D Convolutional Neural Network (CNN) algorithms have been widely used in OCT image analysis. 2D CNN relies on post - prediction aggregation of all B - scans within the OCT volume, while 3D CNN can directly predict glaucoma from OCT data. However, in the absence of widely pre - trained 3D models, the relative efficacy of 2D and 3D - CNN algorithms in detecting glaucoma is not clear. ### Research Objectives To fill this research gap, this study aims to evaluate the efficacy of these models in detecting glaucoma based on volumetric OCT images by using selected state - of - the - art (SOTA) 2D - CNN models, 3D - adapted 2D - CNN models (using a specific weight transfer technique), and a custom - made 5 - layer 3D - CNN - Encoder algorithm. The study conducted performance evaluations on two different datasets, one focusing on the macular area and the other on the optic nerve head, to fully understand the ability of these models to identify glaucoma. ### Main Findings The research results show that the 2D - CNN algorithm performs excellently in glaucoma detection, with an AUC value of 0.960 on OCT test images in the macular area and an AUC value of 0.943 on OCT test images in the optic nerve head. In contrast, the performance of the 3D - CNN model is slightly inferior. This may be due to the lack of large - scale pre - trained 3D models and the problem that 3D models are prone to overfitting when dealing with small - scale datasets. ### Conclusions Although the 3D - CNN model can utilize volumetric information, in the current study, the 2D - CNN model has shown stronger robustness and higher accuracy in glaucoma detection. Future research needs to further explore how to improve the performance of the 3D - CNN model in glaucoma detection, especially in the case of small - scale datasets. These findings provide an important reference for OCT image diagnosis in the field of ophthalmology and point out the direction for future research.