Abstract:Summary Background Accurately distinguishing between malignant and benign thyroid nodules through fine-needle aspiration cytopathology is crucial for appropriate therapeutic intervention. However, cytopathologic diagnosis is time consuming and hindered by the shortage of experienced cytopathologists. Reliable assistive tools could improve cytopathologic diagnosis efficiency and accuracy. We aimed to develop and test an artificial intelligence (AI)-assistive system for thyroid cytopathologic diagnosis according to the Thyroid Bethesda Reporting System. Methods 11 254 whole-slide images (WSIs) from 4037 patients were used to train deep learning models. Among the selected WSIs, cell level was manually annotated by cytopathologists according to The Bethesda System for Reporting Thyroid Cytopathology (TBSRTC) guidelines of the second edition (2017 version). A retrospective dataset of 5638 WSIs of 2914 patients from four medical centres was used for validation. 469 patients were recruited for the prospective study of the performance of AI models and their 537 thyroid nodule samples were used. Cohorts for training and validation were enrolled between Jan 1, 2016, and Aug 1, 2022, and the prospective dataset was recruited between Aug 1, 2022, and Jan 1, 2023. The performance of our AI models was estimated as the area under the receiver operating characteristic (AUROC), sensitivity, specificity, accuracy, positive predictive value, and negative predictive value. The primary outcomes were the prediction sensitivity and specificity of the model to assist cyto-diagnosis of thyroid nodules. Findings The AUROC of TBSRTC III+ (which distinguishes benign from TBSRTC classes III, IV, V, and VI) was 0·930 (95% CI 0·921–0·939) for Sun Yat-sen Memorial Hospital of Sun Yat-sen University (SYSMH) internal validation and 0·944 (0·929 – 0·959), 0·939 (0·924–0·955), 0·971 (0·938–1·000) for The First People's Hospital of Foshan (FPHF), Sichuan Cancer Hospital & Institute (SCHI), and The Third Affiliated Hospital of Guangzhou Medical University (TAHGMU) medical centres, respectively. The AUROC of TBSRTC V+ (which distinguishes benign from TBSRTC classes V and VI) was 0·990 (95% CI 0·986–0·995) for SYSMH internal validation and 0·988 (0·980–0·995), 0·965 (0·953–0·977), and 0·991 (0·972–1·000) for FPHF, SCHI, and TAHGMU medical centres, respectively. For the prospective study at SYSMH, the AUROC of TBSRTC III+ and TBSRTC V+ was 0·977 and 0·981, respectively. With the assistance of AI, the specificity of junior cytopathologists was boosted from 0·887 (95% CI 0·8440–0·922) to 0·993 (0·974–0·999) and the accuracy was improved from 0·877 (0·846–0·904) to 0·948 (0·926–0·965). 186 atypia of undetermined significance samples from 186 patients with BRAF mutation information were collected; 43 of them harbour the BRAF V600E mutation. 91% (39/43) of BRAF V600E -positive atypia of undetermined significance samples were identified as malignant by the AI models. Interpretation In this study, we developed an AI-assisted model named the Thyroid Patch-Oriented WSI Ensemble Recognition (ThyroPower) system, which facilitates rapid and robust cyto-diagnosis of thyroid nodules, potentially enhancing the diagnostic capabilities of cytopathologists. Moreover, it serves as a potential solution to mitigate the scarcity of cytopathologists. Funding Guangdong Science and Technology Department. Translation For the Chinese translation of the abstract see Supplementary Materials section.

Deep learning diagnostic performance and visual insights in differentiating benign and malignant thyroid nodules on ultrasound images

Ultrasound Image Classification of Thyroid Nodules Based on Deep Learning

Deep learning for classification of thyroid nodules on ultrasound: validation on an independent dataset

Deep learning-based artificial intelligence model to assist thyroid nodule diagnosis and management: a multicentre diagnostic study

Artificial intelligence in thyroid ultrasound

Ultrasound-based deep learning using the VGGNet model for the differentiation of benign and malignant thyroid nodules: A meta-analysis

RF11 | PSAT234 Deep learning analysis of thyroid nodule ultrasound images has high sensitivity and negative predictive value to rule-out thyroid cancer

Diagnostic performance of artificial intelligence in interpreting thyroid nodules on ultrasound images: a multicenter retrospective study

Deep learning for the diagnosis of suspicious thyroid nodules based on multimodal ultrasound images

Deep learning models for thyroid nodules diagnosis of fine-needle aspiration biopsy: a retrospective, prospective, multicentre study in China

The Auxiliary Diagnosis of Thyroid Echogenic Foci Based on a Deep Learning Segmentation Model: A Two-Center Study

Evaluation of a Deep Learning-Based Computer-Aided Diagnosis System for Distinguishing Benign from Malignant Thyroid Nodules in Ultrasound Images.

Research on The Identification of Benign and Malignant Thyroid Nodule Ultrasound Images Based on Deep Learning Model

Thyroid Nodules Risk Stratification Through Deep Learning Based on Ultrasound Images

Deep Learning Based on ACR TI-RADS Can Improve the Differential Diagnosis of Thyroid Nodules

Incorporation of a Machine Learning Algorithm With Object Detection Within the Thyroid Imaging Reporting and Data System Improves the Diagnosis of Genetic Risk

Classification of Thyroid Nodules by Using Deep Learning Radiomics Based on Ultrasound Dynamic Video

Using Deep Neural Network to Diagnose Thyroid Nodules on Ultrasound in Patients With Hashimoto’s Thyroiditis

An Intelligent Platform for Ultrasound Diagnosis of Thyroid Nodules

Thyroid ultrasound image classification using a convolutional neural network

A Multi-View Deep Learning Model for Thyroid Nodules Detection and Characterization in Ultrasound Imaging