Joint ordinal regression and multiclass classification for diabetic retinopathy grading with transformers and CNNs fusion network

Lei Ma,Qihang Xu,Hanyu Hong,Yu Shi,Ying Zhu,Lei Wang,Xu, Qihang,Hong, Hanyu,Shi, Yu
DOI: https://doi.org/10.1007/s10489-023-04949-y
IF: 5.3
2023-09-15
Applied Intelligence
Abstract:Diabetic retinopathy (DR) is a chronic complication of diabetes that damages the retinal blood vessels, leading to impaired vision and even blindness, and is one of the top three eye diseases causing human blindness. An effective DR grading algorithm can help ophthalmologists to diagnose patients and improve efficiency. Therefore, we propose a fusion network based on transformer and convolutional neural network (CNN) to perform DR grading. The proposed approach addresses two critical issues in the task: i ) The CNN-based DR classification method has a small receptive field, so the range of available information is limited. The Transformer-based DR classification method has a large receptive field, but it is easy to lose local details; ii ) Existing approaches treat DR grading as a traditional multiclass classification task, which ignores ordinal information between DR of different levels. To address i ), we fuse the multi-level features of CNN and Transformer at different stages, and realize the enhancement and interaction of local and global information through the fusion module. To address ii ), we propose a KC loss by formulating DR grading as a joint ordinal regression and multiclass classification problem, and obtain both category-supervised information and ordinal supervised information. Extensive qualitative and quantitative assessments have shown that our approach achieves superior performance on publicly available DeepDR and IDRiD datasets.
computer science, artificial intelligence
What problem does this paper attempt to address?