Construction and interpretation of machine learning-based prognostic models for survival prediction among intestinal-type and diffuse-type gastric cancer patients
Kunxiang Ji,Lei Shi,Yan Feng,Linna Wang,HuanNan Guo,Hui Li,Jiacheng Xing,Siyu Xia,Boran Xu,Eryu Liu,YanDan Zheng,Chunfeng Li,Mingyang Liu
DOI: https://doi.org/10.1186/s12957-024-03550-y
2024-10-15
Abstract:Background: Gastric cancer is one of the most common malignant tumors worldwide, with high incidence and mortality rates, and it has a complex etiology and complex pathological features. Depending on the tumor type, gastric cancer can be classified as intestinal-type and diffuse-type gastric cancer, each with distinct pathogenic mechanisms and clinical presentations. In recent years, machine learning techniques have been widely applied in the medical field, offering new perspectives for the diagnosis, treatment, and prognosis of gastric cancer patients. Methods: This study recruited 2158 gastric cancer patients and constructed prognostic prediction models for both intestinal-type and diffuse-type gastric cancer. Clinical pathological data were collected from patients, and machine learning algorithms were used for feature selection and model construction. The performance of the models was validated with training and testing datasets. The Shapley additive explanations (SHAP) values were used to interpret the model predictions and identify the main factors that influence patient survival. Results: In the prognostic model for intestinal-type gastric cancer, the gradient boosting decision tree (GBDT) model demonstrated the best performance, with key features including pTNM, CA125, tumor size, CA199, and PALB. Similarly, in the prognostic model for diffuse-type gastric cancer, the GBDT model was utilized, with key features comprising pTNM, Borrmann type IV disease, lymphocyte (LYM), lactate dehydrogenase (LDH), potassium (K), perineural invasion (PNI), tumor size, and whole stomach location. Risk stratification analysis revealed that the prognosis of high-risk patients was significantly worse than that of low-risk patients. Conclusion: Machine learning shows great potential in predicting survival outcomes of gastric cancer patients, providing strong support for the development of personalized treatment plans.