Adaptive ensembling of multi-modal deep spatial representations for diabetic retinopathy diagnosis

Veeranjaneyulu N,Jyostna Devi Bodapati
DOI: https://doi.org/10.1007/s11042-024-18356-z
IF: 2.577
2024-01-29
Multimedia Tools and Applications
Abstract:Diabetic Retinopathy (DR) stands as one of the most prevalent complications among individuals with diabetes, potentially resulting in irreversible vision loss. Recent efforts within the research community have focused on developing Computer-Aided Diagnosis tools, harnessing color fundus retinal scan images to automate the assessment of diabetic retinopathy severity grades. Leveraging the latest advancements in Computer Vision and Neural Networks, these solutions have demonstrated impressive accuracy in identifying the early stages of retinopathy. However, their performance diminishes when faced with higher severity grades, likely due to the scarcity of labeled data for such cases. This study aims to address this limitation by delving into deep spatial representations derived from color fundus retinal scan images associated with higher severity grades. This is possibly due to the small number of labelled samples available for higher severity grades. Different from the existing approaches, we exploit deep spatial representations extracted from a diverse set of pre-trained deep convolutional neural networks to craft an Adaptive Ensemble Classifier. This novel methodology excels at accurately classifying the severity grades of diabetic retinopathy. Our experiments, conducted on the Kaggle APTOS-2019 benchmark dataset, illustrate the superiority of multi-modal deep spatial representations when utilized in conjunction with the Adaptive Ensemble Classifier, by achieving 81.86% accuracy and surpassing the performance of hand-crafted and uni-modal representations for retinal scan images. These findings offer a promising stride towards enhancing the accuracy of diabetic retinopathy diagnosis, particularly in the context of more advanced severity grades.
computer science, information systems, theory & methods,engineering, electrical & electronic, software engineering
What problem does this paper attempt to address?