Recognition of protein folding kinetics pathways based on amino acid properties information derived from primary sequence

Lili Xi,Shuyan Li,Yuhui Wei,Xin'an Wu,Huanxiang Liu,Xiaojun Yao
DOI: https://doi.org/10.1016/j.chemolab.2013.04.019
IF: 4.175
2013-01-01
Chemometrics and Intelligent Laboratory Systems
Abstract:Recognition of protein folding kinetics pathways is an effective approach for the study of protein folding behaviors, and thereby to get a better understanding of mechanism that how a protein folds into a functional structure. In this study, we presented a novel method for the classification of protein folding kinetics pathways based on a new class of features weighted by amino acid properties, which were derived from protein primary sequence. According to the leave-one-out and bootstrap cross-validation results, the model with eight features was the best one, and it achieved a satisfactory prediction accuracy of 91.67% for training set; while n-fold cross-validation had also been performed and the results showed that the built model was stable. Besides, the external test set was employed to evaluate the predictive ability of the built model. The accuracy for external test set achieved 88.24% and MCC was 0.79. Next, the selected important features were analyzed for a better understanding of the protein folding mechanisms. The analysis suggests that long-range interaction and unfolding Gibbs free energy change are important factors in determining the protein folding kinetics pathways. Besides, hydrophobicity, secondary structure and charges are also implied to be the important properties that affect the behavior of protein folding.
What problem does this paper attempt to address?