Abstract:Objective: Breast cancer has become the most prevalent malignant tumor in women, and the occurrence of distant metastasis signifies a poor prognosis. Utilizing predictive models to forecast distant metastasis in breast cancer presents a novel approach. This study aims to utilize readily available clinical data and advanced machine learning algorithms to establish an accurate clinical prediction model. The overall objective is to provide effective decision support for clinicians. Methods: Data from 239 patients from two centers were analyzed, focusing on clinical blood biomarkers (tumor markers, liver and kidney function, lipid profile, cardiovascular markers). Spearman correlation and the least absolute shrinkage and selection operator regression were employed for feature dimension reduction. A predictive model was built using LightGBM and validated in training, testing, and external validation cohorts. Feature importance correlation analysis was conducted on the clinical model and the comprehensive model, followed by univariate and multivariate regression analysis of these features. Results: Through internal and external validation, we constructed a LightGBM model to predict de novo bone metastasis in newly diagnosed breast cancer patients. The area under the receiver operating characteristic curve values of this model in the training, internal validation test, and external validation test1 cohorts were 0.945, 0.892, and 0.908, respectively. Our validation results indicate that the model exhibits high sensitivity, specificity, and accuracy, making it the most accurate model for predicting bone metastasis in breast cancer patients. Carcinoembryonic Antigen, creatine kinase, albumin-globulin ratio, Apolipoprotein B, and Cancer Antigen 153 (CA153) play crucial roles in the model's predictions. Lipoprotein a, CA153, gamma-glutamyl transferase, α-Hydroxybutyrate dehydrogenase, alkaline phosphatase, and creatine kinase are positively correlated with breast cancer bone metastasis, while white blood cell ratio and total cholesterol are negatively correlated. Conclusion: This study successfully utilized clinical blood biomarkers to construct an artificial intelligence model for predicting distant metastasis in breast cancer, demonstrating high accuracy. This suggests potential clinical utility in predicting and identifying distant metastasis in breast cancer. These findings underscore the potential prospect of developing economically efficient and readily accessible predictive tools in clinical oncology.

Machine learning-based prediction model for distant metastasis of breast cancer

Development and validation of an artificial intelligence model for predicting de novo distant bone metastasis in breast cancer: a dual-center study

Development and validation of AI models using LR and LightGBM for predicting distant metastasis in breast cancer: a dual-center study

Applying machine learning techniques to predict the risk of distant metastasis from gastric cancer: a real world retrospective study

Construction and validation of machine learning models for predicting distant metastases in newly diagnosed colorectal cancer patients: A large‐scale and real‐world cohort study

Prediction and Related Genes of Cancer Distant Metastasis Based on Deep Learning.

Delayed Correction - Binary Search with Errors Made Very Simple but Efficient

Machine Learning Algorithm for Predicting Distant Metastasis of T1 and T2 Gallbladder Cancer Based on SEER Database

[Clinical aspects of amyloidosis with a predominant lesion of the heart].

Evaluation of Machine Learning Algorithms for the Prognosis of Breast Cancer from the Surveillance, Epidemiology, and End Results Database

Application of machine learning algorithm in predicting distant metastasis of T1 gastric cancer

Uveal melanoma distant metastasis prediction system: A retrospective observational study based on machine learning

Cancer Metastasis Prediction and Genomic Biomarker Identification through Machine Learning and eXplainable Artificial Intelligence in Breast Cancer Research

Application of machine learning techniques in real-world research to predict the risk of liver metastasis in rectal cancer

Machine learning to predict the cancer-specific mortality of patients with primary non-metastatic invasive breast cancer

Machine learning-based models for the prediction of breast cancer recurrence risk

Machine learning and mechanistic modeling for prediction of metastatic relapse in early-stage breast cancer

Breast Cancer Prediction Based on Machine Learning

XGBoost-based and tumor-immune characterized gene signature for the prediction of metastatic status in breast cancer

Construction of a predictive model for bone metastasis from first primary lung adenocarcinoma within 3 cm based on machine learning algorithm: a retrospective study

Diabetes-induced changes of nitric oxide influence on the cardiovascular action of secretin