Multi-modality Network Based on CGAN and Attention Mechanism for Glaucoma Grading.

Ling Liu,Yuanyuan Peng,Dehui Xiang,Fei Shi,Xinjian Chen
DOI: https://doi.org/10.1117/12.2654113
2023-01-01
Abstract:Glaucoma is a progressive optic neuropathy characterized by changes in the structure of the optic nerve head and visual field,which is one of the major irreversible blinding eye diseases worldwide. Early screening and timely diagnosis of glaucoma is of significant importance. Fundus color photography and optical coherence tomography (OCT) are the two most effective imaging modalities for glaucoma screening, where significant ocular structural changes, such as vertical cup-to-disc ratio (vCDR) on fundus images and retinal nerve fiber layer (RNFL) thickness on OCT volumes, can be present with both imaging modalities. In recent years, multi-modal deep learning methods have shown great advantages in image classification and segmentation tasks. In this paper, we propose a multi-modal glaucoma grading network with two main contributions: (1) To address the inherent shortage of multi-modal training data, conditional generative adversarial network (CGAN) is used to generate more synthetic images, extending the dataset over the only available dataset. (2) A multi-modality cross-attention (MMCA) module is proposed to further improve the classification accuracy.
What problem does this paper attempt to address?