Adaptive Multiscale Retinal Diagnosis: A Hybrid Trio-Model Approach for Comprehensive Fundus Multi-Disease Detection Leveraging Transfer Learning and Siamese Networks

Yavuz Selim Inan
2024-05-28
Abstract:WHO has declared that more than 2.2 billion people worldwide are suffering from visual disorders, such as media haze, glaucoma, and drusen. At least 1 billion of these cases could have been either prevented or successfully treated, yet they remain unaddressed due to poverty, a lack of specialists, inaccurate ocular fundus diagnoses by ophthalmologists, or the presence of a rare disease. To address this, the research has developed the Hybrid Trio-Network Model Algorithm for accurately diagnosing 12 distinct common and rare eye diseases. This algorithm utilized the RFMiD dataset of 3,200 fundus images and the Binary Relevance Method to detect diseases separately, ensuring expandability and avoiding incorrect correlations. Each detector, incorporating finely tuned hyperparameters to optimize performance, consisted of three feature components: A classical transfer learning CNN model, a two-stage CNN model, and a Siamese Network. The diagnosis was made using features extracted through this Trio-Model with Ensembled Machine Learning algorithms. The proposed model achieved an average accuracy of 97% and an AUC score of 0.96. Compared to past benchmark studies, an increase of over 10% in the F1-score was observed for most diseases. Furthermore, using the Siamese Network, the model successfully made predictions in diseases like optic disc pallor, which past studies failed to predict due to low confidence. This diagnostic tool presents a stable, adaptive, cost-effective, efficient, accessible, and fast solution for globalizing early detection of both common and rare diseases.
Image and Video Processing,Computer Vision and Pattern Recognition,Machine Learning
What problem does this paper attempt to address?
The paper attempts to address the issue that globally, over 2.2 billion people suffer from visual impairments, of which at least 1 billion cases could be avoided through prevention or treatment. However, due to poverty, lack of specialists, inaccurate fundus diagnosis, and the presence of rare diseases, these cases have not been properly managed. To tackle this problem, the research developed a Hybrid Trio-Network Model Algorithm aimed at accurately diagnosing 12 common and rare eye diseases through fundus color photographs. Specifically, the model utilized 3200 fundus images from the RFMiD dataset and employed the Binary Relevance Method to detect various diseases separately, ensuring the model's scalability and avoiding erroneous correlations. Each detector comprises three feature components: a classical transfer learning CNN model, a two-stage CNN model, and a Siamese network. The features extracted by these components are combined with ensemble machine learning algorithms to enhance diagnostic accuracy. The proposed model demonstrated excellent performance in diagnosing multiple eye diseases, achieving an average accuracy of 97% and an AUC score of 0.96. Compared to previous research benchmarks, the F1 scores for most diseases improved by over 10%. Notably, for some hard-to-diagnose diseases such as Optic Disc Pallor, the model successfully made predictions, achieving an F1 score improvement from 0 to 0.55. In summary, the study proposes a stable, adaptable, cost-effective, efficient, and easily accessible solution aimed at achieving early detection of common and rare eye diseases on a global scale.