Accurately predicting transcription start sites using logitlinear model and local oligonucleotide frequencies

Jia Wang,Chuang Ma,Dao Zhou,Libin Zhang,Yanhong Zhou
DOI: https://doi.org/10.1007/978-3-642-24553-4_16
2011-01-01
Abstract:In this study, we construct a transcription start site (TSS) prediction model using the logitlinear model and the genomic context features mined in promoter regions. We also develop a computational program named ProKey that is able to accurately predict TSSs in long DNA sequences. Performance evaluation results on the whole human genome show that ProKey could achieve 71.2% sensitivity and 76.3% specificity at the resolution level of 2000bp. Further comparison results exhibit that the correlation coefficient (CC) value of ProKey is higher than that of DragonGSF and Eponine.
What problem does this paper attempt to address?