CT and CEA‐based Machine Learning Model for Predicting Malignant Pulmonary Nodules

Man Liu,Zhigang Zhou,Fenghui Liu,Meng Wang,Yulin Wang,Mengyu Gao,Huifang Sun,Xue Zhang,Ting Yang,Longtao Ji,Jiaqi Li,Qiufang Si,Liping Dai,Songyun Ouyang
DOI: https://doi.org/10.1111/cas.15561
IF: 5.7
2022-09-04
Cancer Science
Abstract:Computed tomography (CT), an efficient radiological technology, is used to detect lung cancer in clinic. Carcinoembryonic antigen (CEA), a common tumor biomarker, is applied in the detection of various tumors. To highlight the advantages of the two‐dimensional techniques and assist the clinicians in optimizing lung cancer diagnostic scheme, we established a favourable model combined CT and CEA. In the study, univariate analysis was performed to screen independent predictors in training cohort of 271 patients with malignant pulmonary nodules (MPNs) and 92 with benign pulmonary nodules (BPNs). Six machine learning‐based models involved 5 CT predictors (mediastinal lymph node enlargement, lobulation, vascular notch sign, spiculation, nodule number) and lnCEA were constructed and validated in an independent cohort of 129 participants (92 MPNs and 37 BPNs) by SPSS Modeler. Nomogram production and Delong test were generated by R software. Finally, the model established by logistic regression owned highest diagnostic efficiency (AUC=0.912). Moreover, the diagnostic ability of logistic model in the validation cohort (AUC=0.882, 80.4% sensitivity, 75.7% specificity) was higher than Peking University model (AUC=0.712, 68.5% sensitivity, 70.3% specificity) and Mayo model (AUC=0.745, 62.0% sensitivity, 75.7% specificity). Interestingly, for the participants with indeterminate nodule (10‐30 mm) and CEA‐negative, the model reached an AUC of 0.835 (72.3% sensitivity, 83.3% specificity). The AUC for the early lung cancer was as high as 0.822 with 67.3% sensitivity and 78.9% specificity. As a conclusion, the promising model presents a new diagnostic strategy for the clinic to distinguish MPNs from BPNs.
oncology
What problem does this paper attempt to address?