A Transverse and Longitudinal Encoding of Protein Sequence and Its Application

Zhongmei Guo,Sheng Yang,Qingming Hu,Lihong Peng
DOI: https://doi.org/10.1166/jctn.2013.2690
2013-01-01
Journal of Computational and Theoretical Nanoscience
Abstract:Basis on the intrinsic relationship between protein function and its subcellular location, further understanding the function of protein, and identifying the subcellular location becomes the important research area of cell biology and proteomics. In this paper, based on the amino acid composition, we propose a new feature extraction method, which contains Chou's amino acid composition, and also includes the position distribution information of the amino acid residues and Local order information in protein sequence. Then we use a classifiers, which is NN (the nearest neighbor classifier), to predict two standard sequence datasets, Both of these two methods achieve higher predictive success rates by the jackknife tests, Compared with existing models, the experiment result show that overall prediction effect on two datasets are improved.
What problem does this paper attempt to address?