Phosphorylation Site Prediction Based on k-Nearest Neighbor Algorithm and BLOSUM62 Matrix

WANG Ming-Hui,WANG Li-Rong,XU Wen-Long,LIN Xiao-Jun,JIANG Zhao-Hui,FENG Huan-Qing
DOI: https://doi.org/10.3969/j.issn.0258-8021.2007.03.015
2007-01-01
Abstract:Phosphorylation is one of the most important post-translational modifications for eukaryotic proteins.Experimental identification of protein kinases'(PKs) substrates with their phosphorylation sites is time-consuming and often restricted by the availability of enzymatic reactions.Based on machine learning approaches,Phosphorylation sites prediction with their specific kinase from their primary sequences is favorably needed,for these methods can provide fast and automatic annotations,which can be used as guidelines for further experimental consideration.In this paper,we presented a modified k-Nearest Neighbor(k-NN) method measured by the Euclidean distance for phosphorylation site prediction.BLOSUM62-based similarity scores were adopted as the input vectors.Prediction results on several PK groups show that in general,it outperforms state of the art methods: Scansite,KinasePhos and NetPhosK,which suggests that this method is another competitive computational approach in this branch of bioinformatics.This method has the advantages of simpleness,efficiency and robustness.
What problem does this paper attempt to address?