Computational Prediction of Transcription Factor Binding Sites Based on HMM Model and Information Content

Xiaobao SU -,Lifang LIU -
DOI: https://doi.org/10.4156/jdcta.vol5.issue10.18
2011-01-01
International Journal of Digital Content Technology and its Applications
Abstract:The prediction of transcription factor binding sites (TFBSs ) is an essential task in the research of transcription regulation. The commonly used zero-order position specific scoring matrices (PSSMs) is limited in specificity. We thus make some improvements for the existing method. In this paper, a position-specific scoring matrix based on one order HMM according to the known motif sequence is proposed, predictions is presented base on HMM matrices, and the positional information content is used to optimize the likelihood score , then the potential results is filtered. In order to speed up the searching process, ESAsearch, a non-heuristic search algorithm, is used to efficiently find matches of PSSMs in large databases. The results of leave-one-out tests on Saccharomyces cerevisiae MCB,GATA,ROX1, UASPHR binding sites and the prediction results of Rattus norvegicus and Mus Musculus’binding sites show that the algorithm can predict the TFBS efficiently. Comparing with the PSSM based algorithm, our algorithm greatly decreases the number of false positives and improves the specificity.
What problem does this paper attempt to address?