Prediction of membrane protein types in a hybrid space.

Peilin Jia,Ziliang Qian,Kaiyan Feng,Wencong Lu,Yixue Li,Yudong Cai
DOI: https://doi.org/10.1021/pr700715c
2008-01-01
Journal of Proteome Research
Abstract:Prediction of the types of membrane proteins is of great importance both for genome-wide annotation and for experimental researchers to understand proteins' functions. We describe a new strategy for the prediction of the types of membrane proteins using the Nearest Neighbor Algorithm. We introduced a bipartite feature space consisting of two kinds of disjoint vectors, proteins' domain profile and proteins' physiochemical characters. Jackknife cross validation test shows that a combination of both features greatly improves the prediction accuracy. Furthermore, the contribution of the physiochemical features to the classification of membrane proteins has also been explored using the feature selection method called "mRMR" (Minimum Redundancy, Maximum Relevance) (IEEE Trans. Pattern Anal. Mach. Intell. 2005, 27 (8), 1226-1238). A more compact set of features that are mostly contributive to membrane protein classification are obtained. The analyses highlighted both hydrophobicity and polarity as the most important features. The predictor with 56 most contributive features achieves an acceptable prediction accuracy of 87.02%. Online prediction service is available freely on our Web site http://pcal.biosino.org/TransmembraneProteinClassification.html.
What problem does this paper attempt to address?