Supervised Learning Based Systemic Inflammatory Markers Enable Accurate Additional Surgery for pT1NxM0 Colorectal Cancer: A Comparative Analysis of Two Practical Prediction Models for Lymph Node Metastasis
Jinlian Jin,Haiyan Zhou,Shulin Sun,Zhe Tian,Haibing Ren,Jinwu Feng
DOI: https://doi.org/10.2147/CMAR.S337516
2021-12-02
Cancer Management and Research
Abstract:Jinlian Jin, Haiyan Zhou, Shulin Sun, Zhe Tian, Haibing Ren, Jinwu Feng Department of Gastroenterology, The Third Clinical Medical College of China Three Gorges University, Gezhouba Central Hospital of Sinopharm, Yichang, Hubei, 443002, People's Republic of China Correspondence: Jinlian Jin Department of Gastroenterology, The Third Clinical Medical College of China Three Gorges University, Gezhouba Central Hospital of Sinopharm, No. 60, Qiaohu 1st Road, Xiling District, Yichang, Hubei, 443002, People's Republic of China Tel +8613986746553 Email Purpose: Predicting lymph node metastasis (LNM) after endoscopic resection is crucial in determining whether patients with pT1NxM0 colorectal cancer (CRC) should undergo additional surgery. This study was aimed to develop a predictive model that can be used to reduce the current likelihood of overtreatment. Patients and Methods: We recruited a total of 1194 consecutive CRC patients with pT1NxM0 who underwent endoscopic or surgical resection at the Gezhouba Central Hospital of Sinopharm between January 1, 2006, and August 31, 2021. The random forest classifier (RFC) and generalized linear algorithm (GLM) were used to screen out the variables that greatly affected the LNM prediction, respectively. The area under the curve (AUC) and decision curve analysis (DCA) were applied to assess the accuracy of predictive models. Results: Analysis identified the top 10 candidate factors including depth of submucosal invasion, neutrophil-lymphocyte ratio (NLR), platelet lymphocyte ratio (PLR), platelet-to-neutrophil ratio(PNR), venous invasion, poorly differentiated clusters, tumor budding, grade, lymphatic vascular invasion, and background adenoma. The performance of the GLM achieved the highest AUC of 0.79 (95% confidence interval [CI]: 0.30 to 1.28) in the training cohort and robust AUC of 0.80 (95% confidence interval [CI]: 0.36 to 1.24) in the validation cohort. Meanwhile, the RFC exhibited a robust AUC of 0.84 (95% confidence interval [CI]: 0.40 to 1.28) in the training cohort and a high AUC of 0.85 (95% CI: 0.41 to 1.29) in the validation cohort. DCAs also showed that the RFC had superior predictive ability. Conclusion: Our supervised learning-based model incorporating histopathologic parameters and inflammatory markers showed a more accurate predictive performance compared to the GLM. This newly supervised learning-based predictive model can be used to determine an individually tailored treatment strategy. Keywords: colorectal cancer, pT1NxM0, lymph nodes metastasis, prediction model, machine learning, random forest classifier, generalized linear model CRC is the third most common malignant tumor, leading to extremely high rates of mortality. 1,2 Metastasis is the main cause of cancer-related death. 3 According to the current literature reports, even CRC patients diagnosed with pT1NxM0 have an estimated risk of LNM, which has been estimated to occur in 10%~15%. 4,5 Colonoscopy remains the gold standard for detecting and resecting precancerous colorectal lesions, but it is unable to provide the status of the regional lymph nodes. Nowadays, endoscopic resection is accepted as a curative therapy for colorectal cancer because of its minimal invasiveness to the diagnosis and treatment of CRC. 6,7 Additional surgical resection after endoscopic resection in patients with CRC can achieve complete staging and reduce the recurrence rate. 8 However, endoscopic resection of pT1NxM0 CRC should be used selectively because of the high risk of LNM. 9 Therefore, the remaining two-thirds of patients may increase the risk of surgical resection and related postoperative mortality. 10 In addition, unnecessary surgical resection will not bring clinical benefits. Due to the lack of preoperative prediction of LNM, it is difficult to determine additional surgery after endoscopic resection of pT1NxM0 CRC. Given this situation, there is now a pressing need to develop methods to determine whether pT1NxM0 CRC patients should undergo additional surgery. Supervised learning(SL) is a branch of artificial intelligence, which encapsulates statistical and iterative algorithms to make fact query and complex decision-making possible. 11,12 In addition, SL analysis is more effective than the traditional logistic linear regression (LLR) statistical method and can optimize variable screening. 13 Therefore, combinatory uses of SL practical analysis and medical records for LNM prediction in the early monitoring of patients with pT1NxM0 CRC are worth exploring. In this study, we aimed to develop an LNM risk prediction model for pT1NxM0 CRC that utilizes clinical medical data to stratify patients by LNM risk after endoscopic resec -Abstract Truncated-
oncology