Better Prediction of the Location of Alpha-Turns in Proteins with Support Vector Machine

Yan Wang,Zhidong Xue,Jin Xu
DOI: https://doi.org/10.1002/prot.21062
2006-01-01
Proteins Structure Function and Bioinformatics
Abstract:We have developed a novel method named AlphaTurn to predict a-turns in proteins based on the support vector machine (SVM). The prediction was done on a data set of 469 nonhomologous proteins containing 967 a-turns. A great improvement in prediction performance was achieved by using multiple sequence alignment generated by PSI-BLAST as input instead of the single amino acid sequence. The introduction of secondary structure information predicted by PSIPRED also improved the prediction performance. Moreover, we handled the very uneven data set by combining the cost factor j with the "state-shifting" rule. This further promoted the prediction quality of our method. The final SVM model yielded a Matthews correlation coefficient (MCC) of 0.25 by a 10-fold cross-validation. To our knowledge, this MCC value is the highest obtained so far for predicting a-turns. An online Web server based on this method has been developed and can be freely accessed at httpJ/bmc.hust.edu.cn/ bioinformatics/ or http://210.42.106.80/.
What problem does this paper attempt to address?