Development and Validation of a Deep Learning Model to Predict the Occurrence and Severity of Retinopathy of Prematurity
Qiaowei Wu,Yijun Hu,Zhenyao Mo,Rong Wu,Xiayin Zhang,Yahan Yang,Baoyi Liu,Yu Xiao,Xiaomin Zeng,Zhanjie Lin,Ying Fang,Yijin Wang,Xiaohe Lu,Yanping Song,Wing W. Y. Ng,Songfu Feng,Honghua Yu
DOI: https://doi.org/10.1001/jamanetworkopen.2022.17447
2022-01-01
JAMA Network Open
Abstract:Importance:Retinopathy of prematurity (ROP) is the leading cause of childhood blindness worldwide. Prediction of ROP before onset holds great promise for reducing the risk of blindness.Objective:To develop and validate a deep learning (DL) system to predict the occurrence and severity of ROP before 45 weeks' postmenstrual age.Design, Setting, and Participants:This retrospective prognostic study included 7033 retinal photographs of 725 infants in the training set and 763 retinal photographs of 90 infants in the external validation set, along with 46 characteristics for each infant. All images of both eyes from the same infant taken at the first screening were labeled according to the final diagnosis made between the first screening and 45 weeks' postmenstrual age. The DL system was developed using retinal photographs from the first ROP screening and clinical characteristics before or at the first screening in infants born between June 3, 2017, and August 28, 2019.Exposures:Two models were specifically designed for predictions of the occurrence (occurrence network [OC-Net]) and severity (severity network [SE-Net]) of ROP. Five-fold cross-validation was applied for internal validation.Main Outcomes and Measures:Area under the receiver operating characteristic curve (AUC), accuracy, sensitivity, and specificity to evaluate the performance in ROP prediction.Results:This study included 815 infants (450 [55.2%] boys) with mean birth weight of 1.91 kg (95% CI, 1.87-1.95 kg) and mean gestational age of 33.1 weeks (95% CI, 32.9-33.3 weeks). In internal validation, mean AUC, accuracy, sensitivity, and specificity were 0.90 (95% CI, 0.88-0.92), 52.8% (95% CI, 49.2%-56.4%), 100% (95% CI, 97.4%-100%), and 37.8% (95% CI, 33.7%-42.1%), respectively, for OC-Net to predict ROP occurrence and 0.87 (95% CI, 0.82-0.91), 68.0% (95% CI, 61.2%-74.8%), 100% (95% CI, 93.2%-100%), and 46.6% (95% CI, 37.3%-56.0%), respectively, for SE-Net to predict severe ROP. In external validation, the AUC, accuracy, sensitivity, and specificity were 0.94, 33.3%, 100%, and 7.5%, respectively, for OC-Net, and 0.88, 56.0%, 100%, and 35.3%, respectively, for SE-Net.Conclusions and Relevance:In this study, the DL system achieved promising accuracy in ROP prediction. This DL system is potentially useful in identifying infants with high risk of developing ROP.