Computational Prediction of Species-Specific Malonylation Sites Via Enhanced Characteristic Strategy.

Li-Na Wang,Shao-Ping Shi,Hao-Dong Xu,Ping-Ping Wen,Jian-Ding Qiu
DOI: https://doi.org/10.1093/bioinformatics/btw755
IF: 5.8
2016-01-01
Bioinformatics
Abstract:Motivation: Protein malonylation is a novel post-translational modification (PTM) which orchestrates a variety of biological processes. Annotation of malonylation in proteomics is the first-crucial step to decipher its physiological roles which are implicated in the pathological processes. Comparing with the expensive and laborious experimental research, computational prediction can provide an accurate and effective approach to the identification of many types of PTMs sites. However, there is still no online predictor for lysine malonylation.Results: By searching from literature and database, a well-prepared up-to-data benchmark datasets were collected in multiple organisms. Data analyses demonstrated that different organisms were preferentially involved in different biological processes and pathways. Meanwhile, unique sequence preferences were observed for each organism. Thus, a novel malonylation site online prediction tool, called MaloPred, which can predict malonylation for three species, was developed by integrating various informative features and via an enhanced feature strategy. On the independent test datasets, AUC (area under the receiver operating characteristic curves) scores are obtained as 0.755, 0.827 and 0.871 for Escherichia coli (E. coli), Mus musculus (M. musculus) and Homo sapiens (H. sapiens), respectively. The satisfying results suggest that MaloPred can provide more instructive guidance for further experimental investigation of protein malonylation.Supplementary information: Supplementary data are available at Bioinformatics online.
What problem does this paper attempt to address?