Differentiation Between Two-State and Multi-State Folding Proteins Based on Sequence

Ji-Tao Huang,Jin-Pei Cheng
DOI: https://doi.org/10.1002/prot.21893
2008-01-01
Proteins Structure Function and Bioinformatics
Abstract:Prediction of protein-folding rates follows different rules in two-state and multi-state kinetics. The prerequisite for the prediction is to recognize the folding kinetic pathway of proteins. Here, we use the logistic regression and support vector machine to discriminate between two-state and multi-state folding proteins. We find that chain length is sufficient to accurately recognize multi-state proteins. There is a transition boundary between two kinetic models. Protein folds with multi-state kinetics, if its length is larger than 112 residues. The logistic prediction from amino acid composition shows that the kinetic pathway of folding is closely related to amino acid volume. Small amino acids make two-state folding easier, and vice versa. However, cysteine, alanines arginine lysine histidine, and methionine do not conform to this rule.
What problem does this paper attempt to address?