Identifying the key components in ResNet-50 for diabetic retinopathy grading from fundus images: a systematic investigation

Yijin Huang,Li Lin,Pujin Cheng,Junyan Lyu,Roger Tam,Xiaoying Tang

DOI: https://doi.org/10.48550/arXiv.2110.14160

2022-10-18

Abstract:Although deep learning based diabetic retinopathy (DR) classification methods typically benefit from well-designed architectures of convolutional neural networks, the training setting also has a non-negligible impact on the prediction performance. The training setting includes various interdependent components, such as objective function, data sampling strategy and data augmentation approach. To identify the key components in a standard deep learning framework (ResNet-50) for DR grading, we systematically analyze the impact of several major components. Extensive experiments are conducted on a publicly-available dataset EyePACS. We demonstrate that (1) the DR grading framework is sensitive to input resolution, objective function, and composition of data augmentation, (2) using mean square error as the loss function can effectively improve the performance with respect to a task-specific evaluation metric, namely the quadratically-weighted Kappa, (3) utilizing eye pairs boosts the performance of DR grading and (4) using data resampling to address the problem of imbalanced data distribution in EyePACS hurts the performance. Based on these observations and an optimal combination of the investigated components, our framework, without any specialized network design, achieves the state-of-the-art result (0.8631 for Kappa) on the EyePACS test set (a total of 42670 fundus images) with only image-level labels. We also examine the proposed training practices on other fundus datasets and other network architectures to evaluate their generalizability. Our codes and pre-trained model are available at <a class="link-external link-https" href="https://github.com/YijinHuang/pytorch-classification" rel="external noopener nofollow">this https URL</a>.

Image and Video Processing,Computer Vision and Pattern Recognition

What problem does this paper attempt to address?

The problem this paper attempts to address is identifying the key components that have the most significant impact on the grading of diabetic retinopathy (DR) within a standard deep learning framework (ResNet-50). Specifically, the authors systematically analyze several major components in the training setup, including input resolution, loss function, data augmentation strategies, learning rate adjustment strategies, data sampling strategies, etc., to determine which components have the greatest impact on DR grading performance and propose methods to optimize these components. Through extensive experiments on the publicly available EyePACS dataset, the authors demonstrate the following major findings: 1. **Input Resolution**: The DR grading framework is highly sensitive to input resolution, with higher resolutions significantly improving performance. 2. **Loss Function**: Using Mean Squared Error (MSE) as the loss function can effectively improve the performance of specific task evaluation metrics (quadratic weighted Kappa). 3. **Data Augmentation**: Different combinations of data augmentation have a significant impact on performance. 4. **Data Sampling Strategy**: Utilizing eye pairs can enhance DR grading performance, while using data resampling to address class imbalance in the EyePACS dataset can actually harm performance. Based on these observations and the optimal component combinations, the proposed framework achieved state-of-the-art results on the EyePACS test set (Kappa value of 0.8631), using only image-level labels. Additionally, the authors validated the generalization ability of these training practices on other fundus datasets and network architectures, further demonstrating the effectiveness and robustness of the proposed methods.

Identifying the key components in ResNet-50 for diabetic retinopathy grading from fundus images: a systematic investigation

Grading of Diabetic Retinopathy Images Based on Graph Neural Network.

Deep attentive convolutional neural network for automatic grading of imbalanced diabetic retinopathy in retinal fundus images

Grouping and Decoupling Mechanism for Diabetic Retinopathy Image Grading.

End-to-end diabetic retinopathy grading based on fundus fluorescein angiography images using deep learning

Fundus Image Generation and Classification of Diabetic Retinopathy Based on Convolutional Neural Network

Early detection of anthracycline cardiotoxicity in children with acute leukemia using exercise-based echocardiography and Doppler echocardiography.

Robust Collaborative Learning of Patch-Level and Image-Level Annotations for Diabetic Retinopathy Grading From Fundus Image

Deep Learning of Diabetic Retinopathy Classification in Fundus Images

Diabetic Retinopathy Classification With Deep Learning via Fundus Images: A Short Survey

An Attention-Based Swin U-Net-Based Segmentation and Hybrid Deep Learning Based Diabetic Retinopathy Classification Framework Using Fundus Images

Deep convolutional neural networks for diabetic retinopathy detection by image classification

Classification of Diabetic Retinopathy Severity in Fundus Images with DenseNet121 and ResNet50

Development of preprocessing methods and revised EfficientNet for diabetic retinopathy detection

A ResNet is All You Need? Modeling A Strong Baseline for Detecting Referable Diabetic Retinopathy in Fundus Images

Recognition of Diabetic Retinopathy Grades Based on Data Augmentation and Attention Mechanisms

Coarse-to-fine classification for diabetic retinopathy grading using convolutional neural network

Deep learning based computer-aided automatic prediction and grading system for diabetic retinopathy

Feature fusion and optimization integrated refined deep residual network for diabetic retinopathy severity classification using fundus image

Attention-Driven Cascaded Network for Diabetic Retinopathy Grading from Fundus Images

Diagnosis and detection of diabetic retinopathy based on transfer learning