Deepmal: Accurate Prediction Of Protein Malonylation Sites By Deep Neural Networks
Minghui Wang,Xiaowen Cui,Shan Li,Xinhua Yang,Anjun Ma,Yusen Zhang,Bin Yu
DOI: https://doi.org/10.1016/j.chemolab.2020.104175
IF: 4.175
2020-01-01
Chemometrics and Intelligent Laboratory Systems
Abstract:Lysine malonylation is one of the important post-translational modification (PTM) of proteins. Malonylated proteins can affect various cell functions of eukaryotes and prokaryotes, which play an important role in metabolic pathways, mitochondrial functions, fatty acid oxidation and other life activities. However, accurate identification of the malonylation sites is the key to further understand the molecular mechanism of malonylation. Currently, experimental identification is still a challenging task, which usually requires a large amount of laboratory work and considerable cost. Regarding this situation, there is an urgent need to establish useful calculation methods and develop effective predictors. We propose a new deep learning network model called DeepMal. Firstly, features are extracted by enhanced amino acid composition (EAAC), enhanced grouped amino acid composition (EGAAC), dipeptide deviation from the expected mean (DDE), K nearest neighbors (KNN) and BLOSUM62 matrix. Secondly, the linear convolutional neural network is used to extract the specific features of malonylation sites, select the relevant features and reduce the feature dimension through maximum pooling. Finally, malonylation sites and non-malonylation sites are classified by a multilayer neural network. And an independent dataset is used to assess the predictive ability of the model DeepMal. On the independent datasets Escherichia coli (E. coli), Homo sapiens (H. sapiens), Mus musculus (M. musculus), the AUC values are 0.974, 0.956 and 0.944, and the accuracies are 96.5%, 95.5% and 94.5%, respectively. Compared with other prediction models, the prediction accuracy is increased by 9.5%-18.5%, and this indicates the effectiveness of the prediction model DeepMal. Using deep learning network can enhance the robustness of DeepMal model for predicting malonylation sites.