RF-Phos: A Novel General Phosphorylation Site Prediction Tool Based on Random Forest

Hamid D Ismail,Ahoi Jones,Jung H Kim,Robert H Newman,Dukka B Kc,Hamid D. Ismail,Jung H. Kim,Robert H. Newman,Dukka B. KC
DOI: https://doi.org/10.1155/2016/3281590
2016-01-01
BioMed Research International
Abstract:Protein phosphorylation is one of the most widespread regulatory mechanisms in eukaryotes. Over the past decade, phosphorylation site prediction has emerged as an important problem in the field of bioinformatics. Here, we report a new method, termed Random Forest-based Phosphosite predictor 2.0 (RF-Phos 2.0), to predict phosphorylation sites given only the primary amino acid sequence of a protein as input. RF-Phos 2.0, which uses random forest with sequence and structural features, is able to identify putative sites of phosphorylation across many protein families. In side-by-side comparisons based on 10-fold cross validation and an independent dataset, RF-Phos 2.0 compares favorably to other popular mammalian phosphosite prediction methods, such as PhosphoSVM, GPS2.1, and Musite.
biotechnology & applied microbiology,medicine, research & experimental
What problem does this paper attempt to address?