Long Non-Coding RNAs Function Annotation: a Global Prediction Method Based on Bi-Colored Networks.

Xingli Guo,Lin Gao,Qi Liao,Hui Xiao,Xiaoke Ma,Xiaofei Yang,Haitao Luo,Guoguang Zhao,Dechao Bu,Fei Jiao,Qixiang Shao,RunSheng Chen,Yi Zhao
DOI: https://doi.org/10.1093/nar/gks967
IF: 14.9
2012-01-01
Nucleic Acids Research
Abstract:More and more evidences demonstrate that the long non-coding RNAs (lncRNAs) play many key roles in diverse biological processes. There is a critical need to annotate the functions of increasing available lncRNAs. In this article, we try to apply a global network-based strategy to tackle this issue for the first time. We develop a bi-colored network based global function predictor, long non-coding RNA global function predictor ('lnc-GFP'), to predict probable functions for lncRNAs at large scale by integrating gene expression data and protein interaction data. The performance of lnc-GFP is evaluated on protein-coding and lncRNA genes. Cross-validation tests on protein-coding genes with known function annotations indicate that our method can achieve a precision up to 95%, with a suitable parameter setting. Among the 1713 lncRNAs in the bi-colored network, the 1625 (94.9%) lncRNAs in the maximum connected component are all functionally characterized. For the lncRNAs expressed in mouse embryo stem cells and neuronal cells, the inferred putative functions by our method highly match those in the known literature.
What problem does this paper attempt to address?