Predicting the Splice Sites of Human Genome Based on Position-correlation Weight Matrix and DNA Structural Parameters

张鹏飞,李前忠,左永春,李涛
DOI: https://doi.org/10.3969/j.issn.1000-1638.2010.04.007
2010-01-01
Abstract:The human splice site recognition is an important problem.The DNA geometric descriptor and position-correlation weight matrix(PCWM)are introduced to describe the conservative segments around spice sites.And the support vector machine(SVM)models combined with the PCWM scoring function and DNA structural features are developed and used to predict the donor and acceptor spice sites of human genome.For five-fold cross-validation,the total prediction accuracies are 92.55% and 90.70% for donors and acceptors respectively.For 3-way data split,the total accuracies are 92.25% and 89.87% for donors and acceptors,respectively.
What problem does this paper attempt to address?