Abstract:Identification of the methylated residues is helpful for us to understand the molecular mechanism of many biological processes. Currently, almost all existing computational methods for methylation site prediction are based on the protein sequences. However, the 3-D structures of proteins are more directly correlated with their biological properties than the sequences. Therefore, in view of few similar works have been done before, a novel method for predicting protein lysine methylation sites were firstly proposed based on single-residue structural features. Different from previous works extracting fragments with the methylated site in the center which contain several neighboring residues as samples, only the single methylated lysine site is considered as a sample in this paper. Then, on basis of the 3-D structures of methylated proteins, we gave a comprehensive feature representation for each methylated lysine by combing accessible surface area (ASA), protrusion index (CX) and depth index (DPX), secondary structure (SS), residue interaction network (RIN) and electrostatics potential (EP). All of these features can well characterize the environmental information of each methylated lysine, in other words, the structural information of the neighboring residues has been integrated into the features of it. According to our analysis, we suggest that it's more efficient to establish the model focusing on single sites than adding adjacent residues. The prediction model was assessed by the testing set and yielded a good performance with the sensitivity of 95.1% and specificity of 89.0%. Moreover, a common independent dataset was collected for further evaluating our model and other five existing sequence-based methods. The prediction results indicate that our method outperforms them and all experimentally confirmed methylated sites are successfully identified by our model. Finally, we conducted predictions on a proteomic scale in order to provide guidance for further experiments. All results indicate that our method can be a useful implement in identifying methylated lysine sites.

Predicting Protein Lysine Methylation Sites by Incorporating Single-Residue Structural Features into Chou's Pseudo Components

Prediction And Analysis Of Protein Methylarginine And Methyllysine Based On Multisequence Features

Two-Level Protein Methylation Prediction Using Structure Model-Based Features

Accurate Prediction of Lysine Methylation Sites Using Evolutionary and Structural-Based Information

PLMLA: Prediction of Lysine Methylation and Lysine Acetylation by Combining Multiple Features.

Computational prediction of methylation types of covalently modified lysine and arginine residues in proteins.

Proteome-wide Prediction of Lysine Methylation Reveals Novel Histone Marks and Outlines the Methyllysine Proteome

Pmes: Prediction Of Methylation Sites Based On Enhanced Feature Encoding Scheme

Proteome-wide Prediction of Lysine Methylation Leads to Identification of H2BK43 Methylation and Outlines the Potential Methyllysine Proteome.

Accurate in Silico Prediction of Species-Specific Methylation Sites Based on Information Gain Feature Optimization

Identifying Protein Arginine Methylation Sites Using Global Features of Protein Sequence Coupled with Support Vector Machine Optimized by Particle Swarm Optimization Algorithm

Hybrid Bayesian Optimization-based Graphical Discovery for Methylation Sites Prediction

A Method To Distinguish Between Lysine Acetylation And Lysine Methylation From Protein Sequences

Progress and challenges in predicting protein methylation sites

Prediction of Protein Lysine Acylation by Integrating Primary Sequence Information with Multiple Functional Features

A Novel Computational Method For Detecting Dna Methylation Sites With Dna Sequence Information And Physicochemical Properties

A chemo‐selective enrichment strategy to achieve in‐depth coverage of methyllysine proteome

Identification of protein methylation sites by coupling improved ant colony optimization algorithm and support vector machine.

ChIP-seq Data Plays an Important Role in a Cytosine-Based DNA Methylation Prediction Model

Prediction of Protein Methylation Sites Using Conditional Random Field.

Fast Prediction of Protein Methylation Sites Using a Sequence-Based Feature Selection Technique