Incorporating Cellular Sorting Structure for Better Prediction of Protein Subcellular Locations

Wen-Yun Yang,Bao-Liang Lu,James T. Kwok
DOI: https://doi.org/10.1080/0952813x.2010.506303
2011-01-01
Abstract:This article explores the interdependences between subcellular locations and incorporates them with support vector machines for prediction of protein subcellular localisation. Traditional prediction systems utilise a 'flat' structure of classifiers, such as the one-versus-all and one-versus-one schemes, with amino acid compositions to perform the prediction. Apart from those existing studies that ignore the interdependences between subcellular locations, we take advantage of a hierarchical structure to organise the subcellular locations and model their relationships. Here, we propose to use four kinds of hierarchical prediction methods and make comparative studies on three datasets. Experimental results show that three of the hierarchical models outperform the traditional 'flat' model in terms of tree loss values. In particular, one hierarchical model outperforms the traditional 'flat' model for all evaluation measures. Moreover, we gained some valuable insights into the sorting process by using hierarchical structures.
What problem does this paper attempt to address?