Prelnc2: A prediction tool for lncRNAs with enhanced multi-level features of RNAs

Hua Gao,Peng Gao,Ning Ye
DOI: https://doi.org/10.1371/journal.pone.0286377
IF: 3.7
2023-06-02
PLoS ONE
Abstract:Long non-coding RNAs (lncRNAs) have been widely studied for their important biological significance. In general, we need to distinguish them from protein coding RNAs (pcRNAs) with similar functions. Based on various strategies, algorithms and tools have been designed and developed to train and validate such classification capabilities. However, many of them lack certain scalability, versatility, and rely heavily on genome annotation. In this paper, we design a convenient and biologically meaningful classification tool "Prelnc2" using multi-scale position and frequency information of wavelet transform spectrum and generalizes the frequency statistics method. Finally, we used the extracted features and auxiliary features together to train the model and verify it with test data. PreLnc2 achieved 93.2% accuracy for animal and plant transcripts, outperforming PreLnc by 2.1% improvement and our method provides an effective alternative to the prediction of lncRNAs.
multidisciplinary sciences
What problem does this paper attempt to address?