Identification of the Protein Superfamilies by Using of the Least Diversity Increment Based on the Hydropathy Distribution of Amino Acids

LIU Fen,LI Qian-zhong
DOI: https://doi.org/10.3969/j.issn.1000-1638.2006.04.011
2006-01-01
Abstract:The usual methods for identifying the protein superfamilies are always to search for the common patterns of the proteins in the same superfamily,and the patterns have been considered as the core to decide the structure and function of proteins.The distributions of hydropathicity along the amino acid sequence are selected as the parameters of the sources of diversity.The four different superfamlies in the same structure class are predicted by the least increments of diversity within the four increments.The results show that the overall prediction accuracies of different superfamlies are 83.0% and 81.2% for all-α class,both 80.9% for all-β class,(88.6%) and 88.0% for α+β class,69.3% and 67.6% for α/β class by self-consistency test and jack-knife test,respectively.
What problem does this paper attempt to address?