A Markov chain Monte Carlo sampling relevance vector machine model for recognizing transcription start sites

JunCai Huang,FengBi Wang,Huanzhang Mao,MingTian Zhou
DOI: https://doi.org/10.1109/AICI.2010.277
2010-01-01
Abstract:The task of finding transcription start sites (TSSs) can be modeled as a classification problem. Relevance vector machines (RVM) is a family of machine learning methods that represent a Bayesian approach to the training of general linear models (GLM).Based on the Markov-chain Monte Carlo(MCMC) sampler, propose a model for using the RVM to explore very large numbers of candidate features.The model applyes the power of the RVM to classifying and detecting interesting points and regions in biological sequence data. The model has been used successfully for testing predicting transcription start sites and other features in genome sequences. Our experimental results on real nucleotide sequences data show that our method improve the prediction accuracy greatly and our method performs significantly better thanPromoterInspector and CpG islands. © 2010 IEEE.
What problem does this paper attempt to address?