Prediction of Primate Splice Site Using Inhomogeneous Markov Chain and Neural Network.

Libin Liu,Yee-Kin Ho,Stephen Yau
DOI: https://doi.org/10.1089/dna.2007.0583
2007-01-01
DNA and Cell Biology
Abstract:The inhomogeneous Markov chain model is used to discriminate acceptor and donor sites in genomic DNA sequences. It outperforms statistical methods such as homogeneous Markov chain model, higher order Markov chain and interpolated Markov chain models, and machine-learning methods such as k-nearest neighbor and support vector machine as well. Besides its high accuracy, another advantage of inhomogeneous Markov chain model is its simplicity in computation. In the three states system (acceptor, donor, and neither), the inhomogeneous Markov chain model is combined with a three-layer feed forward neural network. Using this combined system 3175 primate splice-junction gene sequences have been tested, with a prediction accuracy of greater than 98%.
What problem does this paper attempt to address?