Prediction of the E.coli-K12 Promoter Based on Multi-Feature Selection

Hao Lin
2010-01-01
Abstract:According to the known knowledge of 741 experimentally confirmed Sigma70 promoters,the promoters are predicted. At first,based on the interaction between RNAp and DNA elements,the position-correlation-score-function (PCSF) algorithm is used to measure the conservative sits in promoter sequences. Subsequently,according to the characteristics of promoters,a diversity index is applied to measure the information content in different regions. Finally,the modified Mahalanobis Discriminant is proposed to perform prediction. The overall accuracies of 10-fold cross-validation of 85%+ are achieved. By comparing with other methods,it is shown that the proposed method can recognize the Escherichia coli promoters with high accuracy.
What problem does this paper attempt to address?