Residue-couple Model for Protein Subcellular Loaclization Prediction

Jian Guo,Zhirong Sun
2003-01-01
Abstract:As a key functional characteristic of protein, subcellular localization performs an important role in genome analysis. Therefore, an automatic, reliable and efficient prediction system for protein subcellular localization is needed for large-scale genome analysis. In this paper, we construct a new model (residue-couple model) and use support vector machine under this model frame for subcelluar localization. In addition of the traditional amino acid composition model, residue-couple model incorporates the effect of sequence order. The total accuracy of prediction reached up to 92.0% for prokaryotic protein sequences and 86.9% for eukaryotic protein sequences under 5-fold cross validation, which represents a significant improvement compared with the precede methods. We also prove that our model is robust to the errors of N-terminal in sequences.
What problem does this paper attempt to address?