Predicting Promoters by Pseudo-Trinucleotide Compositions Based on Discrete Wavelets Transform.

Xuan Zhou,Zhanchao Li,Zong Dai,Xiaoyong Zou
DOI: https://doi.org/10.1016/j.jtbi.2012.11.024
IF: 2.405
2012-01-01
Journal of Theoretical Biology
Abstract:In this paper, the discrete wavelet transform was introduced into the trinucleotide compositions and a novel DNA sequence representation technique, namely pseudo-trinucleotide compositions was proposed. The pseudo-trinucleotide compositions based on discrete wavelets transform were then employed to model support vector machines (SVM) for the prediction of promoters. The model was evaluated on the genie dataset, and the overall prediction accuracy (ACC) by jackknife validation for the classification of promoters, introns and exons was 82.46%, while the ACC for the classification of promoters and unpromoters was 82.18%, which was far better than the previous results. The satisfied prediction result revealed that the pseudo-trinucleotide composition based on discrete wavelet transform was an effective representation method for DNA sequence, and plays a very important role in the prediction of DNA function.
What problem does this paper attempt to address?