Prediction of Golgi-resident Protein Types by Using Feature Selection Technique

Hui Ding,Shou-Hui Guo,En-Ze Deng,Lu-Feng Yuan,Feng-Biao Guo,Jian Huang,Nini Rao,Wei Chen,Hao Lin
DOI: https://doi.org/10.1016/j.chemolab.2013.03.005
IF: 4.175
2013-01-01
Chemometrics and Intelligent Laboratory Systems
Abstract:The functions of Golgi apparatus are to store, package and distribute proteins. Knowing the type of a Golgi-resident protein will provide in-depth insight into its function. In this study, we developed a support vector machine-based method to identify the types of Golgi-resident proteins by using only amino acid sequence information. A strictly and objective dataset including 137 proteins with the sequence identity <25% was used for training and testing the support vector machine. The analysis of variance was proposed to find out the optimized feature set. In the leave-one-out cross-validation, the maximum overall accuracy of 85.4% was achieved with the area under the receiver operating characteristic curves of 0.878. The results demonstrate that the proposed method can be used to discriminate the types of Golgi-resident proteins. An on-line server subGolgi is freely available at http://lin.uestc.edu.cn/server/subGolgi2.
What problem does this paper attempt to address?