Prediction of Protein Domain Folding Classes

LI Kun,LI Qian-zhong
DOI: https://doi.org/10.3969/j.issn.1000-1638.2005.02.009
2005-01-01
Abstract:Based on the recent SCOP database, the numbers of #alpha#-helices, #beta#-sheets and #beta##alpha##beta# fragments of the 2616 protein domains were caculated by using their secondary structure sequences in Brookhaven protein Data Bank (PDB). The structural classes of a protein can be predicted by using those numbers as parameters and the method of increment of diversity. The results show that the high rates of correct prediction are obtained in spite of using three different standard sets and test sets. The overall average rates of correct prediction are over 99%, 92%, 89A% and 87% for All-#alpha#, All-#beta#, #alpha#/#beta# and #alpha#+#beta# classes of protein respectively. For a standard set, the overall average rate of correct prediction is 93.82%; and for test set, the overall average of correct prediction is 94.35%.
What problem does this paper attempt to address?