Abstract:Post-translational modifications (PTMs) are fundamental to essential biological processes, exerting significant influence over gene expression, protein localization, stability, and genome replication. Sumoylation, a PTM involving the covalent addition of a chemical group to a specific protein sequence, profoundly impacts the functional diversity of proteins. Notably, identifying sumoylation sites has garnered significant attention due to their crucial roles in proteomic functions and their implications in various diseases, including Parkinson's and Alzheimer's. Despite the proposal of several computational models for identifying sumoylation sites, their effectiveness could be improved by the limitations associated with conventional learning methodologies. In this study, we introduce pseudo-position-specific scoring matrix (PsePSSM), a robust computational model designed for accurately predicting sumoylation sites using an optimized deep learning algorithm and efficient feature extraction techniques. Moreover, to streamline computational processes and eliminate irrelevant and noisy features, sequential forward selection using a support vector machine (SFS-SVM) is implemented to identify optimal features. The multi-layer Deep Neural Network (DNN) is a robust classifier, facilitating precise sumoylation site prediction. We meticulously assess the performance of PSSM-Sumo through a tenfold cross-validation approach, employing various statistical metrics such as the Matthews Correlation Coefficient (MCC), accuracy, sensitivity, specificity, and the Area under the ROC Curve (AUC). Comparative analyses reveal that PSSM-Sumo achieves an exceptional average prediction accuracy of 98.71%, surpassing existing models. The robustness and accuracy of the proposed model position it as a promising tool for advancing drug discovery and the diagnosis of diverse diseases linked to sumoylation sites.

SIMLIN: a Bioinformatics Tool for Prediction of S-sulphenylation in the Human Proteome Based on Multi-Stage Ensemble-Learning Models

SulSite-GTB: Identification of Protein S-sulfenylation Sites by Fusing Multiple Feature Information and Gradient Tree Boosting

DLF-Sul: a Multi-Module Deep Learning Framework for Prediction of S-sulfinylation Sites in Proteins.

DeepCSO: a Deep-Learning Network Approach to Predicting Cysteine S-sulphenylation Sites

Sohpred: A New Bioinformatics Tool For The Characterization And Prediction Of Human S-Sulfenylation Sites

Accurate in Silico Identification of Protein Succinylation Sites Using an Iterative Semi-Supervised Learning Technique.

Prediction of Protein S-Sulfenylation Sites Using a Deep Belief Network

PredSulSite: Prediction of Protein Tyrosine Sulfation Sites with Multiple Features and Analysis

PredCSO: an Ensemble Method for the Prediction of S-sulfenylation Sites in Proteins

Predicting the Protein SUMO Modification Sites Based on Properties Sequential Forward Selection (PSFS).

Deep Learning Based Prediction of Species-Specific Protein S-glutathionylation Sites.

A novel method for high accuracy sumoylation site prediction from protein sequences

SuccSPred2.0: A Two-Step Model to Predict Succinylation Sites Based on Multifeature Fusion and Selection Algorithm.

Detecting Succinylation Sites from Protein Sequences Using Ensemble Support Vector Machine

SSKM_Succ: A Novel Succinylation Sites Prediction Method Incorporating K-Means Clustering with a New Semi-Supervised Learning Algorithm.

Systematic Study of Protein Sumoylation: Development of a Site-Specific Predictor of SUMOsp 2.0.

SLAM: Structure-aware lysine β-hydroxybutyrylation prediction with protein language model

An Ensemble Deep Learning based Predictor for Simultaneously Identifying Protein Ubiquitylation and SUMOylation Sites

PSSM-Sumo: deep learning based intelligent model for prediction of sumoylation sites using discriminative features

A Novel Method for Predicting Post-Translational Modifications on Serine and Threonine Sites by Using Site-Modification Network Profiles

SulfoTyrP: A High Accuracy Predictor of Protein Sulfotyrosine Sites