Application of Hidden Semi-Markov Model to 3' Splice Sites Identification

冯秀程,钱敏平,邓明华,马小土,严熙婷
DOI: https://doi.org/10.3321/j.issn:1000-3282.2004.05.013
2004-01-01
PROGRESS IN BIOCHEMISTRY AND BIOPHYSICS
Abstract:In order to improve exon level sensitivity and specificity of recent gene-finding programs, strong "search by signal" components are needed to identify splice sites, translation start and other biological signal sites. A new model for the identification of 3' splice sites (acceptors) using Hidden Semi-Markov Model (HSMM) was introduced. This model is proved to be particularly suitable for modeling the biological structure of acceptors. When tested in Burset/Guigo dataset,this new method demonstrated an improved accuracy compared with existing method. The success of this model gives a deep understanding of the structure of acceptors and the biological process of splicing.
What problem does this paper attempt to address?