Abstract:Background Pulmonary tuberculosis (PTB) is a global health problem and remains the leading infectious cause of death worldwide. Differentiation between secondary PTB and non-tuberculous (non-TB) pneumonia is important for patient isolation and treatment, but can be difficult to determine clinically and radiologically. We proposed the application of deep learning to chest computerized tomography (CT) to assist doctors in detecting and differentiating PTB from non-TB pneumonia in an expedient, non-invasive, and reproducible manner. Methods: We retrospectively collected a dataset containing 1,124 CT scans from 923 PTB and non-TB pneumonia based on their pathological reports of lung biopsy and clinical information and 201 patients without pulmonary infiltrate from West China Hospital between 2012 and 2018. Randomly selected parts of this dataset (WCPR dataset) were used to develop, train, internally validate and test the algorithm. Patients in the WCPR dataset (PTB, n=439; non-TB pneumonia, n=484; normal, n= 201) were randomly assigned in three non-overlapping sets: training set, n=866; validation set, n=108; and test set, n=150. An additional dataset from NIH TB Portal31 comprising of cases from three countries (Belarus, n = 274; Romania, n = 43; Moldova, n = 10) was used to validate externally the algorithm's ability to identify PTB. A convolutional neural network of Inception-Res-Net-v230 was trained and tested on the entire chest CT to mimic real life application. The performance of our algorithm was compared to three trained radiology/pulmonology physicians. Findings For differentiating pulmonary infiltrates, the algorithm achieved 99·3% accuracy (149 out of 150), 99.0% sensitivity, and 100·0% specificity. For identifying PTB, the algorithm achieved 82·0% accuracy (123 out of 150), 95·9% sensitivity, and 75·2% specificity. For identifying non-TB pneumonia, the algorithm achieved 81·3% accuracy (122 out of 150), 52·7% sensitivity, and 97·9% specificity. This mostly outperformed our human readers for PTB identification, who averaged up to 81·1% accuracy, 70·8% sensitivity, and 86·1% specificity. Our algorithm identified 287 out of 327 PTB (87·8% accuracy) cases in NIH TB Portal Dataset from other countries. Interpretation: Our deep-learning-based algorithm successfully differentiated abnormal from normal chest CTs, as well as PTB from non-TB pneumonia cases and thus allows real world applicability. Early identification of PTB from non-TB pneumonia can help control outbreaks through isolation and early appropriate treatment. The application of our algorithm could expedite the identification of PTB, thereby improving clinical outcomes. Our datasets and algorithm used in this study will be publicly available to facilitate world-wide adoption. Funding Statement: The authors declare: None. Declaration of Interests: The authors declare: None. Ethics Approval Statement: This study was approved by the Institutional Review Board of West China Hospital (approval No. 2019-148) and Icahn School of Medicine at Mount Sinai (approval No. GCO#1: 19-0569(0001) ISMMS), and the patients’ written consents were waived.

Transformer-based deep learning model for the diagnosis of suspected lung cancer in primary care based on electronic health record data

AI-based approach to enable proactive identification of early lung cancer: A retrospective population health study and economic model.

[Epidemiological and clinical characteristics and prognostic factors of ovarian carcinoma].

Machine Learning and Real-World Data to Predict Lung Cancer Risk in Routine Care

Performance of a Machine Learning Algorithm Using Electronic Health Record Data to Identify and Estimate Survival in a Longitudinal Cohort of Patients With Lung Cancer

Validation of a Deep Learning-Based Model to Predict Lung Cancer Risk Using Chest Radiographs and Electronic Medical Record Data

Longitudinal Multimodal Transformer Integrating Imaging and Latent Clinical Signatures From Routine EHRs for Pulmonary Nodule Classification

Machine learning computational model to predict lung cancer using electronic medical records

Early detection of non-small cell lung cancer using electronic health record data

A Classifier for Improving Early Lung Cancer Diagnosis Incorporating Artificial Intelligence and Liquid Biopsy

Machine Learning for Early Discrimination Between Lung Cancer and Benign Nodules Using Routine Clinical and Laboratory Data

Development of Lung Cancer Risk Prediction Machine Learning Models for Equitable Learning Health System: Retrospective Study

Deep Learning Model for Pathological Grading and Prognostic Assessment of Lung Cancer Using CT Imaging: A Study on NLST and External Validation Cohorts

Towards radiologist-level cancer risk assessment in CT lung screening using deep learning

Pulmonologists-Level lung cancer detection based on standard blood test results and smoking status using an explainable machine learning approach

Primary Care Datasets for Early Lung Cancer Detection: an AI Led Approach

A Study of Social and Behavioral Determinants of Health in Lung Cancer Patients Using Transformers-based Natural Language Processing Models

A Generalized Deep Learning Approach for Evaluating Secondary Pulmonary Tuberculosis on Chest Computed Tomography

Deep Learning Predicts Lung Cancer Treatment Response from Serial Medical Imaging

Sybil: A Validated Deep Learning Model to Predict Future Lung Cancer Risk From a Single Low-Dose Chest Computed Tomography