Abstract:The study of protein-protein interactions (PPIs) is important in understanding the function of proteins. However, it is still a challenge to investigate the transient protein-protein interaction by experiments. Hence, the computational prediction for protein-protein interactions draws growing attention. Statistics-based features have been widely used in the studies of protein structure prediction and protein folding. Due to the scarcity of experimental data of PPI, it is difficult to construct a conventional statistical feature for PPI prediction, and the application of statistics-based features is very limited in this field. In this paper, we explored the application of frustration, a statistical potential, in PPI prediction. By comparing the energetic contribution of the extra stabilization energy from a given residue pair in the native protein with the statistics of the energies, we obtained the residue pair's frustration index By calculating the number of residue pairs with a high frustration index, the highly frustrated density, a residue-frustration-based feature, was then obtained to describe the tendency of residues to be involved in PPI. Highly frustrated density, as well as structure-based features, were then used to describe protein residues and combined with the long short-term memory (LSTM) neural network to predict PPI residue pairs. Our model correctly predicted 75% dimers when only the top 2 parts per thousand residue pairs were selected in each dimer. Our model, which considers the statistics-based features, is significantly different from the models based on the chemical features of residues. We found that frustration can effectively describe the tendency of residue to be involved in PPI. Frustration-based features can replace chemical features to combine with machine learning and realize the better performance of PPI prediction. It reveals the great potential of statistical potential such as frustration in PPI prediction.

Protein-protein Interaction Extraction Based on Self-Training

Protein-protein interaction extraction from bio-literature with compact features and data sampling strategy

Deep Learning Frameworks for Protein–protein Interaction Prediction

DeepTrio: a Ternary Prediction System for Protein-Protein Interaction Using Mask Multiple Parallel Convolutional Neural Networks

A hybrid method for extraction of protein-protein interactions from literature

Extracting Protein-Protein Interactions (PPIs) from Biomedical Literature using Attention-based Relational Context Information

SemiGNN-PPI: Self-Ensembling Multi-Graph Neural Network for Efficient and Generalizable Protein-Protein Interaction Prediction

Residue-Frustration-Based Prediction of Protein-Protein Interactions Using Machine Learning

Effective Protein-Protein Interaction Exploration with PPIretrieval

Protein-protein interaction prediction via structure-based deep learning

Automatic Extraction of Protein Interaction in Literature

Independence in Possibility Theory under Different Triangular Norms

Coelomomyces opifexi (Pillai & Smith). Coelomomycetaceae: Blastocladiales II. Experiments in sporangial germination

Protein Complexes Prediction Via Positive and Unlabeled Learning of the PPI Networks

Advances in Computational Methods for Protein–Protein Interaction Prediction

SDNN-PPI: self-attention with deep neural network effect on protein-protein interaction prediction

Improving protein-protein interaction site prediction using deep residual neural network

Protein-Protein Interactions Prediction Based on Bi-directional Gated Recurrent Unit and Multimodal Representation

Prediction Of Protein-Protein Interactions Using Subcellular And Functional Localizations

Learning the protein language of proteome-wide protein-protein binding sites via explainable ensemble deep learning

DeepPPI: Boosting Prediction of Protein-Protein Interactions with Deep Neural Networks.