Encoding Based on Grouped Weight for Protein Sequence and Its Application to Structural Class Prediction

Yongxian Wang
2007-01-01
Computer Engineering and Applications Journal
Abstract:Based on the idea of coarse-grained description in physics,a new encoding method with grouped weight for protein sequence is presented and applied to protein structural class prediction associated with component-coupled algorithm.The average rate of correct recognition is 99.72% in Resubstitution test and 91.09% in Jack-knife test for standard set of 359 proteins.For the same training dataset and the same predictive algorithm,the overall predictive accuracy of our method for the Jack-knife test is 7% higher than the accuracy based only on the amino-acid composition,especially for the class of α+β is 15% higher than that for amino-acid composition method.The experiment results show that the encoding method is efficient to extract the structure information implicated in protein sequence.
What problem does this paper attempt to address?