US-based radiomics analysis of different machine learning models for differentiating benign and malignant BI-RADS 4A breast lesions

Jieyi Ye,Yinting Chen,Jiawei Pan,Yide Qiu,Zhuoru Luo,Yue Xiong,Yanping He,Yingyu Chen,Fuqing Xie,Weijun Huang
DOI: https://doi.org/10.1016/j.acra.2024.08.024
2024-08-26
Abstract:Rationale and objectives: To investigate and authenticate the effectiveness of various radiomics models in distinguishing between benign and malignant BI-RADS 4A lesions. Methods: A total of 936 patients with pathologically confirmed 4A lesions were included in the study (training cohort: n = 655; test cohort: n = 281). Radiomic features were derived from greyscale US images. Following dimensionality reduction and feature selection, radiomics models were developed using logistic regression (LR), support vector machine (SVM), random forest (RF), eXtreme gradient boosting (XGBoost) and multilayer perceptron (MLP) algorithms. Univariate and multivariable logistic regression analyses were employed to investigate clinical-radiological characteristics and determine variables for creating a clinical model. Five combined models integrating radiomic and clinical parameters were constructed by using each algorithm, and comparison with radiologists' performance was performed. SHapley Additive exPlanations (SHAP) approach was used to elucidate the radiomic model by ranking the significance of features based on their contribution to the evaluation. Results: A total of 1561 radiomic features were extracted. Thirty-six features were deemed significant by dimensionality reduction and selection. The radiomic models showed good performance with AUCs of 0.829-0.945 in training cohort; and 0.805-0.857 in test cohort. The combined model developed by using LR showed the best performance (AUC, training cohort: 0.909; test cohort: 0.905), which is superior to radiologists' performance. Decision curve analysis (DCA) of this combined model indicated better clinical efficacy than clinical and radiomic models. Conclusions: The combined model integrating radiomic and clinical features demonstrated excellent performance in differentiating between benign and malignant 4A lesions. It may offer a non-invasive and efficient approach to aid in clinical decision-making.
What problem does this paper attempt to address?