Applying a random projection algorithm to optimize machine learning model for predicting peritoneal metastasis in gastric cancer patients using CT images

Seyedehnafiseh Mirniaharikandehei,Morteza Heidari,Gopichandh Danala,Sivaramakrishnan Lakshmivarahan,Bin Zheng
DOI: https://doi.org/10.1016/j.cmpb.2021.105937
IF: 6.1
2021-03-01
Computer Methods and Programs in Biomedicine
Abstract:<h3 class="u-h4 u-margin-m-top u-margin-xs-bottom">Background and Objective</h3><p>Non-invasively predicting the risk of cancer metastasis before surgery can play an essential role in determining which patients can benefit from neoadjuvant chemotherapy. This study aims to investigate and test the advantages of applying a random projection algorithm to develop and optimize a radiomics-based machine learning model to predict peritoneal metastasis in gastric cancer patients using a small and imbalanced computed tomography (CT) image dataset.</p><h3 class="u-h4 u-margin-m-top u-margin-xs-bottom">Methods</h3><p>A retrospective dataset involving CT images acquired from 159 patients is assembled, including 121 and 38 cases with and without peritoneal metastasis, respectively. A computer-aided detection scheme is first applied to segment primary gastric tumor volumes and initially compute 315 image features. Then, five gradients boosting machine (GBM) models embedded with five feature selection methods (including random projection algorithm, principal component analysis, least absolute shrinkage, and selection operator, maximum relevance and minimum redundancy, and recursive feature elimination) along with a synthetic minority oversampling technique, are built to predict the risk of peritoneal metastasis. All GBM models are trained and tested using a leave-one-case-out cross-validation method.</p><h3 class="u-h4 u-margin-m-top u-margin-xs-bottom">Results</h3><p>Results show that the GBM model embedded with a random projection algorithm yields a significantly higher prediction accuracy (71.2%) than the other four GBM models (p&lt;0.05). The precision, sensitivity, and specificity of this optimal GBM model are 65.78%, 43.10%, and 87.12%, respectively.</p><h3 class="u-h4 u-margin-m-top u-margin-xs-bottom">Conclusions</h3><p>This study demonstrates that CT images of the primary gastric tumors contain discriminatory information to predict the risk of peritoneal metastasis, and a random projection algorithm is a promising method to generate optimal feature vector, improving the performance of machine learning based prediction models.</p>
engineering, biomedical,computer science, interdisciplinary applications,medical informatics, theory & methods
What problem does this paper attempt to address?