Sub-Loc: Predicting Protein Sub-Mitochondrial Localization Based on Sequence Embedding.

Xiaoting Wang,Juan Wang,Haodong Bian,Maozu Guo
DOI: https://doi.org/10.1109/bibm55620.2022.9995316
2022-01-01
Abstract:Mitochondria are subcellular organelles existing in most eukaryotic organisms. They have a pivotal role in lots of bio-chemical processes for cells. Proteins in different compartments of mitochondria have their transport routes. Locating proteins in mitochondria can provide a solid foundation for mitochondrial pathologies. So far, there have been several computational methods for solving the issue. However, their accuracy is so low that they do not identify the localization precisely. We develop an unsupervised learning model to represent mitochondrial proteins as n-dimensional vectors, called sequence embedding, which can learn the global and context information from mitochondrial proteins. We design a new model, called Sub-Loc, to predict the sub-mitochondrial localization of proteins using the SVM classifier and the sequence embedding method. The sequence embedding method and the Sub-Loc are tested by experiments. Experimental results show the sequence embedding method remarkably enhances the performance of prediction for sub-mitochondrial localization of proteins compared with other feature representations. The Sub-Loc outperforms other approaches for predicting sub-mitochondrial localization of proteins.
What problem does this paper attempt to address?