An Effective Computational Method For Human Splice Sites Identification

Jiuqiang Han,Ying Cui,Jun Liu,Xinman Zhang
DOI: https://doi.org/10.1109/ASCC.2013.6606395
2013-01-01
Abstract:Owing to the vast amount of DNA sequence data, the prediction of the complete structure of genes from the genomic DNA sequence becomes an important issue. For the eukaryotes, especially for the human genome, the splice sites identification plays a crucial role in gene structure prediction. A hybrid feature extraction approach which combing the position weight matrix (PWM) with the increment of diversity (ID) was proposed. Based on the extracted features, the support vector machine (SVM) was applied to classify authentic and false splice sites. The new algorithm was shown to be effective and simple. By the proposed algorithm, 92.98% of donor sites and 90.46% of acceptor sites were correctly classified. It is anticipated that the novel computational method is promising for the identification of splice sites in human genome.
What problem does this paper attempt to address?