Abstract:Background and objectives: Pulmonary embolism (PE) is a complex disease with high mortality and morbidity rate, leading to increasing society burden. However, current diagnosis is solely based on symptoms and laboratory data despite its complex pathology, which easily leads to misdiagnosis and missed diagnosis by inexperienced doctors. Especially, CT pulmonary angiography, the gold standard method, is not widely available. In this study, we aim to establish a rapid and accurate screening model for pulmonary embolism using machine learning technology. Importantly, data required for disease prediction are easily accessed, including routine laboratory data and medical record information of patients.Methods: We extracted features from patients' routine laboratory results and medical records, including blood routine, biochemical group, blood coagulation routine and other test results, as well as symptoms and medical history information. Samples with a feature loss rate greater than 0.8 were deleted from the original database. Data from 4723 cases were retained, 231 of which were positive for pulmonary embolism. 50 features were retained through the positive and negative statistical hypothesis testing which was used to build the predictive model. In order to avoid identification as majority-class samples caused by the imbalance of sample proportion, we used the method of Synthetic Minority Oversampling Technique (SMOTE) to increase the amount of information on minority samples. Five typical machine learning algorithms were used to model the screening of pulmonary embolism, including Support Vector Machines, Logistic Regression, Random Forest, XGBoost, and Back Propagation Neural Networks. To evaluate model performance, sensitivity, specificity and AUC curve were analyzed as the main evaluation indicators. Furthermore, a baseline model was established using the characteristics of the pulmonary embolism guidelines as a comparison model.Results: We found that XGBoost showed better performance compared to other models, with the highest sensitivity and specificity (0.99 and 0.99, respectively). Moreover, it showed significant improvement in performance compared to the baseline model (sensitivity and specificity were 0.76 and 0.76 respectively). More important, our model showed low missed diagnosis rate (0.46) and high AUC value (0.992). Finally, the calculation time of our model is only about 0.05 s to obtain the possibility of pulmonary embolism.Conclusions: In this study, five machine learning classification models were established to assess the likelihood of patients suffering from pulmonary embolism, and the XGBoost model most significantly improved the precision, sensitivity, and AUC for pulmonary embolism screening. Collectively, we have established an AI-based model to accurately predict pulmonary embolism at early stage.

Driverless artificial intelligence framework for the identification of malignant pleural effusion

Development and Validation of a Radiomics Nomogram for Diagnosis of Malignant Pleural Effusion.

Development and validation of a machine learning model for differential diagnosis of malignant pleural effusion using routine laboratory data

Machine learning applied to near-infrared spectra for clinical pleural effusion classification

Diagnosis of malignant pleural effusion with combinations of multiple tumor markers: A comparison study of five machine learning models

Establishment of Machine Learning-Based Tool for Early Detection of Pulmonary Embolism

A multitask deep learning approach for pulmonary embolism detection and identification

Quantitative proteomics revealed protein biomarkers to distinguish malignant pleural effusion from benign pleural effusion

A simple and efficient clinical prediction scoring system to identify malignant pleural effusion

Performance and clinical utility of an artificial intelligence-enabled tool for pulmonary embolism detection

A retrospective study on the combined biomarkers and ratios in serum and pleural fluid to distinguish the multiple types of pleural effusion

Improved detection of small pulmonary embolism on unenhanced computed tomography using an artificial intelligence-based algorithm – a single centre retrospective study

Validation of machine learning algorithms for differentiating tuberculous from malignant pleural effusion.

Massive external validation of a machine learning algorithm to predict pulmonary embolism in hospitalized patients

A deep learning-based algorithm improves radiology residents' diagnoses of acute pulmonary embolism on CT pulmonary angiograms

APPARATUS FOR MOTOR CONDITIONING IN CATS.

An Integrated Clinical and Computerized Tomography-Based Radiomic Feature Model to Separate Benign from Malignant Pleural Effusion

Artificial intelligence based on deep learning for differential diagnosis between benign and malignant pulmonary nodules: A real-world, multicenter, diagnostic study.

Deep learning automated quantification of lung disease in pulmonary hypertension on CT pulmonary angiography: A preliminary clinical study with external validation

A machine learning model for diagnosing acute pulmonary embolism and comparison with Wells score, revised Geneva score, and Years algorithm

Abstract 12497: Pulmonary Embolism Mortality Prediction With Deep Learning Based on Computed Tomographic Pulmonary Angiography and Clinical Data