milRNApredictor: Genome-free prediction of fungi milRNAs by incorporating <i>k</i>-mer scheme and distance-dependent pair potential

Yuangen Yao,Huiyu Zhang,Haiyou Deng
DOI: https://doi.org/10.1016/j.ygeno.2019.12.019
IF: 4.31
2020-01-01
Genomics
Abstract:MicroRNA-like small RNAs (milRNAs) with length of 21-22 nucleotides are a type of small non-coding RNAs that are firstly found in Neurospora crassa in 2010. Identifying milRNAs of species without genomic information is a difficult problem. Here, knowledge-based energy features are developed to identify milRNAs by tactfully incorporating k-mer scheme and distance-dependent pair potential. Compared with k-mer scheme, features developed here can alleviate the inherent curse of dimensionally in k-scheme once k becomes large. In addition, milRNApredictor built on novel features performs comparably to k-mer scheme, and achieves sensitivity of 74.21%, and specificity of 75.72% based on 10-fold cross-validation. Furthermore, for novel miRNA prediction, there exists high overlap of results from milRNApredictor and state-of-the-art mirnovo. However, milRNApredictor is simpler to use with reduced requirements of input data and dependencies. Taken together, milRNApredictor can be used to de novo identify fungi milRNAs and other very short small RNAs of non-model organisms.
What problem does this paper attempt to address?