Predicting subcellular localization of proteins using support vector machine with n-terminal amino composition

Yan-fu Li,Juan Liu
DOI: https://doi.org/10.1007/11527503_73
2005-01-01
Abstract:Prediction of protein subcellular localization is one of the hot research topics in bioinformatics. In this paper, several support vector machines (SVM) with a new presented coding scheme method based on N-terminal amino compositions are first trained to discriminate between proteins destined for the mitochondrion, the chloroplast, the secretory pathway, and ‘other' localizations. Then a decision unit is used to make the final prediction based on several SVMs' outputs. Tested on redundancy-reduced sets, the proposed method reached 89.6 % (plant) and 91.9% (non-plant) total accuracies, which, to the best of our knowledge, are the highest ever reported using the same data sets.
What problem does this paper attempt to address?