Splicing-site Recognition of Rice ( Oryza Sativa L.) DNA Sequences by Support Vector Machines

Peng Si-hua,Fan Long-jiang,Peng Xiao-ning,Zhuang Shu-lin,Du Wei,Chen Liang-biao
DOI: https://doi.org/10.1631/jzus.2003.0573
2003-01-01
Journal of Zhejiang University SCIENCE A
Abstract:MOTIVATION:It was found that high accuracy splicing-site recognition of rice (Oryza sativa L.) DNA sequence is especially difficult. We described a new method for the splicing-site recognition of rice DNA sequences.METHOD:Based on the intron in eukaryotic organisms conforming to the principle of GT-AG, we used support vector machines (SVM) to predict the splicing sites. By machine learning, we built a model and used it to test the effect of the test data set of true and pseudo splicing sites.RESULTS:The prediction accuracy we obtained was 87.53% at the true 5' end splicing site and 87.37% at the true 3' end splicing sites. The results suggested that the SVM approach could achieve higher accuracy than the previous approaches.
What problem does this paper attempt to address?