Predicting the Functional Effects of Human Non-Coding Variants Based on Stacking Ensemble Learning.

Zepeng Liu,Xiao-Tai Huang,Kei Hang Katie Chan,Lin Gao
DOI: https://doi.org/10.1109/bibm55620.2022.9995273
2022-01-01
Abstract:Predicting the functional impact of genetic variants in non-coding regions of the human genome can aid in the elucidation of the etiology of diseases or traits. In recent years, an increasing number of methods to predict the impact of sequence variation in non-coding regions of the human genome have been developed. However, most of current studies are limited to predict specific types of non-coding variants. To address this problem, here we propose a non-coding SNVs prediction method based on stacking integration strategy. The method consists of three stacking models built using the same strategy based on different causality assumptions to predict functional, pathogenic, and cancer driver non-coding SNVs, respectively. We demonstrate that our method outperforms the other seven methods. In addition, a comparison of our proposed model with other methods for non-coding de novo mutations in autism spectrum disease reveals that our model has the highest discriminative ability, indicating that its performance is stable and superior in different scenarios.
What problem does this paper attempt to address?