Abstract:Proteins of known structures are usually classified into four structural classes: all-alpha, all-beta, alpha+beta, and alpha/beta type of proteins. A number of methods to predicting the structural class of a protein based on its amino acid composition have been developed during the past few years. Recently, a component-coupled method was developed for predicting protein structural class according to amino acid composition. This method is based on the least Mahalanobis distance principle, and yields much better predicted results in comparison with the previous methods. However, the success rates reported for structural class prediction by different investigators are contradictory. The highest reported accuracies by this method are near 100%, but the lowest one is only about 60%. The goal of this study is to resolve this paradox and to determine the possible upper limit of prediction rate for structural classes. In this paper, based on the normality assumption and the Bayes decision rule for minimum error, a new method is proposed for predicting the structural class of a protein according to its amino acid composition. The detailed theoretical analysis indicates that if the four protein folding classes are governed by the normal distributions, the present method will yield the optimum predictive result in a statistical sense. A non-redundant data set of 1,189 protein domains is used to evaluate the performance of the new method. Our results demonstrate that 60% correctness is the upper limit for a 4-type class prediction from amino acid composition alone for an unknown query protein. The apparent relatively high accuracy level (more than 90%) attained in the previous studies was due to the preselection of test sets, which may not be adequately representative of all unrelated proteins.

Using pseudo amino acid composition to predict protein structural class: approached by incorporating 400 dipeptide components.

Using pseudo amino acid composition to predict protein structural classes: Approached with complexity measure factor

An Optimization Approach to Predicting Protein Structural Class from Amino Acid Composition

Using Pseudo-Amino Acid Composition and Support Vector Machine to Predict Protein Structural Class.

Prediction of Functional Class of Proteins and Peptides Irrespective of Sequence Homology by Support Vector Machines.

How Good is Prediction of Protein Structural Class by the Component-Coupled Method?

A Correlation-Coefficient Method to Predicting Protein-Structural Classes from Amino-Acid Compositions

A WEIGHTING METHOD FOR PREDICTING PROTEIN STRUCTURAL CLASS FROM AMINO-ACID-COMPOSITION

Accurate Prediction of Protein Structural Classes by Incorporating Predicted Secondary Structure Information into the General Form of Chou's Pseudo Amino Acid Composition.

Using grey dynamic modeling and pseudo amino acid composition to predict protein structural classes

Predict protein structural class for low-similarity sequences by evolutionary difference information into the general form of Chou's pseudo amino acid composition.

Prediction of Protein (domain) Structural Classes Based on Amino-Acid Index.

Predicting Protein Structural Class with Pseudo-Amino Acid Composition and Support Vector Machine Fusion Network.

Prediction of Protein Structural Class Using PSI-BLAST Profile Based Collocation of Amino Acid Pairs

Predicting Protein Structural Classes for Low-Similarity Sequences by Evaluating Different Features

The Prediction of the Structural Class of Protein: Application of the Measure of Diversity

Prediction of Seven Protein Structural Classes by Fusing Multi-Feature Information Including Protein Evolutionary Conservation Information

Prediction of Protein Structural Classes Based on Correlations of Amino Acid Residues

Prediction of protein structural class using novel evolutionary collocation-based sequence representation.

Prediction and Classification of Domain Structural Classes

Improving protein structural class prediction using novel combined sequence information and predicted secondary structural features