Protein sub-cellular localisation prediction by analysis of short-range residue correlations.

Jian Guo,Yuanlie Lin,Zhirong Sun
DOI: https://doi.org/10.1504/IJBRA.2006.009762
2006-01-01
International Journal of Bioinformatics Research and Applications
Abstract:Sub-cellular localisation performs an important role in genome analysis. This paper describes a new residue-couple model using a support vector machine to predict the sub-cellular localisation of proteins. This new approach provides better predictions than the existing methods. The total prediction accuracies on Reinhardt and Hubbard's dataset reach 92.0% for prokaryotic protein sequences and 86.9% for eukaryotic protein sequences with fivefold cross validation. For a new dataset with 8304 proteins located in eight sub-cellular locations, the total accuracy achieves 88.9%. Meanwhile, the model shows robust against N-terminal errors in the sequences.
What problem does this paper attempt to address?