A Novel Probability Model for LncRNA–Disease Association Prediction Based on the Naïve Bayesian Classifier

Jingwen Yu,Pengyao Ping,Lei Wang,Linai Kuang,Xueyong Li,Zhelun Wu
DOI: https://doi.org/10.3390/genes9070345
IF: 4.141
2018-07-08
Genes
Abstract:An increasing number of studies have indicated that long-non-coding RNAs (lncRNAs) play crucial roles in biological processes, complex disease diagnoses, prognoses, and treatments. However, experimentally validated associations between lncRNAs and diseases are still very limited. Recently, computational models have been developed to discover potential associations between lncRNAs and diseases by integrating multiple heterogeneous biological data; this has become a hot topic in biological research. In this article, we constructed a global tripartite network by integrating a variety of biological information including miRNA⁻disease, miRNA⁻lncRNA, and lncRNA⁻disease associations and interactions. Then, we constructed a global quadruple network by appending gene⁻lncRNA interaction, gene⁻disease association, and gene⁻miRNA interaction networks to the global tripartite network. Subsequently, based on these two global networks, a novel approach was proposed based on the naïve Bayesian classifier to predict potential lncRNA⁻disease associations (NBCLDA). Comparing with the state-of-the-art methods, our new method does not entirely rely on known lncRNA⁻disease associations, and can achieve a reliable performance with effective area under ROC curve (AUCs)in leave-one-out cross validation. Moreover, in order to further estimate the performance of NBCLDA, case studies of colorectal cancer, prostate cancer, and glioma were implemented in this paper, and the simulation results demonstrated that NBCLDA can be an excellent tool for biomedical research in the future.
genetics & heredity
What problem does this paper attempt to address?