Deep Learning Analysis with Gray Scale and Doppler Ultrasonography Images to Differentiate Graves’ Disease

Han-Sang Baek,Jinyoung Kim,Chaiho Jeong,Jeongmin Lee,Jeonghoon Ha,Kwanhoon Jo,Min Hee Kim,Tae Seo Sohn,Ihn Suk Lee,Jong Min Lee,Dong-Jun Lim
DOI: https://doi.org/10.1210/clinem/dgae254
2024-04-13
The Journal of Clinical Endocrinology & Metabolism
Abstract:Abstract Context Thyrotoxicosis requires accurate and expeditious differentiation between Graves’ disease (GD) and thyroiditis to ensure effective treatment decisions. Objective This study aimed to develop a machine learning algorithm using ultrasonography and Doppler images to differentiate thyrotoxicosis subtypes, with a focus on GD. Methods This study included patients who initially presented with thyrotoxicosis and underwent thyroid ultrasonography at a single tertiary hospital. A total of 7,719 ultrasonography images from 351 patients with GD and 2,980 images from 136 patients with thyroiditis were used. Data augmentation techniques were applied to enhance the algorithm’s performance. Two deep learning models, Xception and EfficientNetB0_2, were employed. Performance metrics such as accuracy, sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), and F1 score were calculated for both models. Image pre-processing, neural network model generation, and neural network training results verification were performed using DEEP:PHI® platform. Results The Xception model achieved 84.94% accuracy, 89.26% sensitivity, 73.17% specificity, 90.06% PPV, 71.43% NPV, and an F1 score of 89.66 for the diagnosis of GD. The EfficientNetB0_2 model exhibited 85.31% accuracy, 90.28% sensitivity, 71.78% specificity, 89.71% PPV, 73.05% NPV, and an F1 score of 89.99. Conclusion Machine learning models based on ultrasound and Doppler images showed promising results with high accuracy and sensitivity in differentiating GD from thyroiditis.
endocrinology & metabolism
What problem does this paper attempt to address?