Deep learning-based image quality assessment for optical coherence tomography macular scans: a multicentre study
Ziqi Tang,Xi Wang,An Ran Ran,Dawei Yang,Anni Ling,Jason C Yam,Xiujuan Zhang,Simon K H Szeto,Jason Chan,Cherie Y K Wong,Vivian W K Hui,Carmen K M Chan,Tien Yin Wong,Ching-Yu Cheng,Charumathi Sabanayagam,Yih Chung Tham,Gerald Liew,Giridhar Anantharaman,Rajiv Raman,Yu Cai,Haoxuan Che,Luyang Luo,Quande Liu,Yiu Lun Wong,Amanda K Y Ngai,Vincent L Yuen,Nelson Kei,Timothy Y Y Lai,Hao Chen,Clement C Tham,Pheng-Ann Heng,Carol Y Cheung
DOI: https://doi.org/10.1136/bjo-2023-323871
2024-07-20
British Journal of Ophthalmology
Abstract:Aims To develop and externally test deep learning (DL) models for assessing the image quality of three-dimensional (3D) macular scans from Cirrus and Spectralis optical coherence tomography devices. Methods We retrospectively collected two data sets including 2277 Cirrus 3D scans and 1557 Spectralis 3D scans, respectively, for training (70%), fine-tuning (10%) and internal validation (20%) from electronic medical and research records at The Chinese University of Hong Kong Eye Centre and the Hong Kong Eye Hospital. Scans with various eye diseases (eg, diabetic macular oedema, age-related macular degeneration, polypoidal choroidal vasculopathy and pathological myopia), and scans of normal eyes from adults and children were included. Two graders labelled each 3D scan as gradable or ungradable, according to standardised criteria. We used a 3D version of the residual network (ResNet)-18 for Cirrus 3D scans and a multiple-instance learning pipline with ResNet-18 for Spectralis 3D scans. Two deep learning (DL) models were further tested via three unseen Cirrus data sets from Singapore and five unseen Spectralis data sets from India, Australia and Hong Kong, respectively. Results In the internal validation, the models achieved the area under curves (AUCs) of 0.930 (0.885–0.976) and 0.906 (0.863–0.948) for assessing the Cirrus 3D scans and Spectralis 3D scans, respectively. In the external testing, the models showed robust performance with AUCs ranging from 0.832 (0.730–0.934) to 0.930 (0.906–0.953) and 0.891 (0.836–0.945) to 0.962 (0.918–1.000), respectively. Conclusions Our models could be used for filtering out ungradable 3D scans and further incorporated with a disease-detection DL model, allowing a fully automated eye disease detection workflow.
ophthalmology