Predicting Multisite Protein Sub-Cellular Locations Based On Correlation Coefficient

Peng Wu,Dong Wang,Xiao-Fang Zhong,Qing Zhao
DOI: https://doi.org/10.1007/978-3-319-63312-1_67
2017-01-01
Abstract:With the development of proteomics and cell biology, protein sub-cellular location has become a hot topic in bioinformatics. As the time goes on, more and more researchers make great efforts on studying protein sub-cellular location. But they only do research on single-site protein sub-cellular location. However, some proteins can belong to two or more sub-cellulars. So, we should transfer the line of sight to multisite protein sub-cellular location. In this article, we use Virus-mPLoc data set and choose pseudo amino acid composition and correlation coefficient two effective feature extraction methods. Then, putting these features into multi-label k-nearest neighbor classifier to predict protein sub-cellular location. The experiment proves that this method is reasonable and the precision reached 68.65% through the Jack-knife test.
What problem does this paper attempt to address?