Abstract:Background Essential proteins are crucial for cellular life and thus, identification of essential proteins is an important topic and a challenging problem for researchers. Recently lots of computational approaches have been proposed to handle this problem. However, traditional centrality methods cannot fully represent the topological features of biological networks. In addition, identifying essential proteins is an imbalanced learning problem; but few current shallow machine learning-based methods are designed to handle the imbalanced characteristics. Results We develop DeepEP based on a deep learning framework that uses the node2vec technique, multi-scale convolutional neural networks and a sampling technique to identify essential proteins. In DeepEP, the node2vec technique is applied to automatically learn topological and semantic features for each protein in protein-protein interaction (PPI) network. Gene expression profiles are treated as images and multi-scale convolutional neural networks are applied to extract their patterns. In addition, DeepEP uses a sampling method to alleviate the imbalanced characteristics. The sampling method samples the same number of the majority and minority samples in a training epoch, which is not biased to any class in training process. The experimental results show that DeepEP outperforms traditional centrality methods. Moreover, DeepEP is better than shallow machine learning-based methods. Detailed analyses show that the dense vectors which are generated by node2vec technique contribute a lot to the improved performance. It is clear that the node2vec technique effectively captures the topological and semantic properties of PPI network. The sampling method also improves the performance of identifying essential proteins. Conclusion We demonstrate that DeepEP improves the prediction performance by integrating multiple deep learning techniques and a sampling method. DeepEP is more effective than existing methods.

DeepPD: A Deep Learning Method for Predicting Peptide Detectability Based on Multi-feature Representation and Information Bottleneck

Deep2Pep: A Deep Learning Method in Multi-label Classification of Bioactive Peptide

DeepIso: A Deep Learning Model for Peptide Feature Detection

Protein-peptide binding residue prediction based on protein language models and cross-attention mechanism

DeepIso: A Deep Learning Model for Peptide Feature Detection from LC-MS Map

DeepPepPI: A deep cross-dependent framework with information sharing mechanism for predicting plant peptide-protein interactions

DeepCPPred: A Deep Learning Framework for the Discrimination of Cell-Penetrating Peptides and Their Uptake Efficiencies.

Deep Neural Network for Detecting Arbitrary Precision Peptide Features Through Attention Based Segmentation

Deep Learning-Based Multi-Functional Therapeutic Peptides Prediction with a Multi-Label Focal Dice Loss Function.

Predicting Protein-Peptide Binding Residues Via Interpretable Deep Learning.

PTPD: Predicting Therapeutic Peptides by Deep Learning and Word2vec

Predicting Protein Interactions Using a Deep Learning Method-Stacked Sparse Autoencoder Combined with a Probabilistic Classification Vector Machine.

DeepTPpred: A Deep Learning Approach with Matrix Factorization for Predicting Therapeutic Peptides by Integrating Length Information

DeepEP: a Deep Learning Framework for Identifying Essential Proteins.

An Integration of Deep Learning with Feature Embedding for Protein–protein Interaction Prediction

Deep Metric Learning for Proteomics Deep Metric Learning for Proteomics

DeepPPI: Boosting Prediction of Protein-Protein Interactions with Deep Neural Networks.

Pdeep: Predicting MS/MS Spectra of Peptides with Deep Learning.

MMDB: Multimodal Dual-Branch Model for Multi-Functional Bioactive Peptide Prediction

Protein-DNA Binding Residues Prediction Using a Deep Learning Model with Hierarchical Feature Extraction

Anticancer peptides prediction with deep representation learning features