A Novel Stepwise Support Vector Machine (svm) Method Based on Optimal Feature Combination for Predicting Mirna Precursors

Limei Wang,Jin Li,Rongsheng Zhu,Liangde Xu,Ying He,Ruijie Zhang,Shaoqi Rao
DOI: https://doi.org/10.5897/ajb11.2273
2011-01-01
AFRICAN JOURNAL OF BIOTECHNOLOGY
Abstract:MicroRNAs (miRNAs) are a class of non-coding RNAs that are produced from miRNA precursors (pre-miRNAs) with stem-loop structure. At present, development of computational approach for pre-miRNA identification continues to be a challenging task, in which feature selection is greatly important. Here, we first extracted feature subsets by a hybrid algorithm of genetic algorithm (GA) and support vector machine (SVM) from 124 sequence and secondary structure features. Next, based on the high-frequency features taken from the feature subsets, we proposed a novel stepwise SVM method to identify the optimal feature combinations. The cooperative effect was found among different features in our study. Finally, we obtained 10 feature combinations with strong combined effect which possessed high classification performance for predicting pre-miRNAs. In external validation, all the 10 combinations could predict accurately over 13 pre-miRNAs from 16 new confirmed human pre-miRNAs in miRBase 14.0. The best one could reach 15 (93.75%), which significantly outperformed triplet-SVM (13, 81.25%) in predicting pre-miRNAs.
What problem does this paper attempt to address?