Identification of Microrna Precursors Based on Random Forest with Network-Level Representation Method of Stem-Loop Structure.
Jiamin Xiao,Xiaojing Tang,Yizhou Li,Zheng Fang,Daichuan Ma,Yangzhige He,Menglong Li
DOI: https://doi.org/10.1186/1471-2105-12-165
IF: 3.307
2011-01-01
BMC Bioinformatics
Abstract:BACKGROUND:MicroRNAs (miRNAs) play a key role in regulating various biological processes such as participating in the post-transcriptional pathway and affecting the stability and/or the translation of mRNA. Current methods have extracted feature information at different levels, among which the characteristic stem-loop structure makes the greatest contribution to the prediction of putative miRNA precursor (pre-miRNA). We find that none of these features alone is capable of identifying new pre-miRNA accurately.RESULTS:In the present work, a pre-miRNA stem-loop secondary structure is translated to a network, which provides a novel perspective for its structural analysis. Network parameters are used to construct prediction model, achieving an area under the receiver operating curves (AUC) value of 0.956. Moreover, by repeating the same method on two independent datasets, accuracies of 0.976 and 0.913 are achieved, respectively.CONCLUSIONS:Network parameters effectively characterize pre-miRNA secondary structure, which improves our prediction model in both prediction ability and computation efficiency. Additionally, as a complement to feature extraction methods in previous studies, these multifaceted features can reflect natural properties of miRNAs and be used for comprehensive and systematic analysis on miRNA.
What problem does this paper attempt to address?