HBS–STACK: hierarchical biomarker selection and stacked ensemble model for biomarker identification and cancer prediction on multi-omics
Arwinder Dhillon,Ashima Singh,Vinod Kumar Bhalla
DOI: https://doi.org/10.1007/s00521-023-09359-2
2024-01-05
Neural Computing and Applications
Abstract:Genomic and transcriptomic data development has provided new prospects for biomarker identification and cancer prediction. However, it is challenging to capture the biological dataset with complex and nonlinear associations using existing biomarkers and cancer diagnosis techniques. Machine learning offers enormous potential for creating feature selection techniques and models to identify cancer biomarkers. In this article, we propose a Hierarchical Biomarker Selection and Stacked Ensemble model for Biomarker Identification and Cancer Prediction (HBS–STACK) on miRNA, gene expression, and DNA Methylation (DM) datasets. Three-stage biomarker selection is developed comprising an aggregation of information between CpG sites and genes by considering the biological relations at stage 1, Fold Change and False Discovery Rate selection at stage 2, and Light Gradient Boosting Machine with Recursive Feature Elimination (LBGMRFE) selection at stage 3. The selected features and markers are integrated and passed to stacked ML models comprising Gradient Boosting Machine (GBM), Naïve Bayes (NB), Random Forest (RF) at level 1 learning, and DNN at level 2 learning. HBS–STACK is evaluated on breast cancer (BRCA) and is validated on kidney renal clear cell carcinoma (KIRC) from TCGA (The Cancer Genome Atlas) Portal and on Alzheimer Disease. We found several genomic and transcriptomic biomarkers comprising IQSEC1 for BRCA, ZFHX3, CTBP2, and SLC9AR2 for KIRC and TMEM61 for Alzheimer disease, respectively. The experimental results show that the HBS–STACK outperformed GBM, NB, and RF with 99.60, 99.03, and 92.05% accuracy and shows an improvement of 2.27, 26.03, 10.05% in performance compared with existing techniques on BRCA, KIRC, and Alzheimer, respectively.
computer science, artificial intelligence