Prediction of protein structure class by coupling improved genetic algorithm and support vector machine

Z.-C. Li,X.-B. Zhou,Y.-R. Lin,X.-Y. Zou
DOI: https://doi.org/10.1007/s00726-008-0084-z
IF: 3.7891
2008-01-01
Amino Acids
Abstract:Structural class characterizes the overall folding type of a protein or its domain. Most of the existing methods for determining the structural class of a protein are based on a group of features that only possesses a kind of discriminative information for the prediction of protein structure class. However, different types of discriminative information associated with primary sequence have been completely missed, which undoubtedly has reduced the success rate of prediction. We present a novel method for the prediction of protein structure class by coupling the improved genetic algorithm (GA) with the support vector machine (SVM). This improved GA was applied to the selection of an optimized feature subset and the optimization of SVM parameters. Jackknife tests on the working datasets indicated that the prediction accuracies for the different classes were in the range of 97.8–100% with an overall accuracy of 99.5%. The results indicate that the approach has a high potential to become a useful tool in bioinformatics.
What problem does this paper attempt to address?