Abstract:MOTIVATION: Mutations in human genome are mainly through single nucleotide polymorphism, some of which can affect stability and function of proteins, causing human diseases. Several methods have been proposed to predict the effect of mutations on protein stability; but most require features from experimental structure. Given the fast progress in protein structure prediction, this work explores the possibility to improve the mutation-induced stability change prediction using low-resolution structure modeling.RESULTS: We developed a new method (STRUM) for predicting stability change caused by single-point mutations. Starting from wild-type sequences, 3D models are constructed by the iterative threading assembly refinement (I-TASSER) simulations, where physics- and knowledge-based energy functions are derived on the I-TASSER models and used to train STRUM models through gradient boosting regression. STRUM was assessed by 5-fold cross validation on 3421 experimentally determined mutations from 150 proteins. The Pearson correlation coefficient (PCC) between predicted and measured changes of Gibbs free-energy gap, ΔΔG, upon mutation reaches 0.79 with a root-mean-square error 1.2 kcal/mol in the mutation-based cross-validations. The PCC reduces if separating training and test mutations from non-homologous proteins, which reflects inherent correlations in the current mutation sample. Nevertheless, the results significantly outperform other state-of-the-art methods, including those built on experimental protein structures. Detailed analyses show that the most sensitive features in STRUM are the physics-based energy terms on I-TASSER models and the conservation scores from multiple-threading template alignments. However, the ΔΔG prediction accuracy has only a marginal dependence on the accuracy of protein structure models as long as the global fold is correct. These data demonstrate the feasibility to use low-resolution structure modeling for high-accuracy stability change prediction upon point mutations.AVAILABILITY AND IMPLEMENTATION: http://zhanglab.ccmb.med.umich.edu/STRUM/ CONTACT: qiang@suda.edu.cn and zhng@umich.eduSUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Prediction of mutation-induced protein stability changes based on the geometric representations learned by a self-supervised method

STRUM: structure-based prediction of protein stability changes upon single-point mutation

An Efficient Method to Predict Protein Thermostability in Alanine Mutation

Improving the Prediction of Protein Stability Changes Upon Mutations by Geometric Learning and a Pre-Training Strategy

Computer Prediction of Drug Resistance Mutations in Proteins

Analysis and prediction of protein folding energy changes upon mutation by element specific persistent homology

Predicting protein thermal stability changes upon single and multi-point mutations via restricted attention subgraph neural network

BayeStab: Predicting Effects of Mutations on Protein Stability with Uncertainty Quantification

Efficiently Predicting Protein Stability Changes Upon Single-point Mutation with Large Language Models

Review of predicting protein stability changes upon variations

Exploring evolution to uncover insights into protein mutational stability

Predicting a Protein's Stability under a Million Mutations

Predicting protein stability changes upon mutation using a simple orientational potential

Structure-based self-supervised learning enables ultrafast prediction of stability changes upon mutation at the protein universe scale

PON-Tm: A Sequence-Based Method for Prediction of Missense Mutation Effects on Protein Thermal Stability Changes

Assessing computational tools for predicting protein stability changes upon missense mutations using a new dataset

Three Simple Properties Explain Protein Stability Change upon Mutation

A three-state prediction of single point mutations on protein stability changes

AI challenges for predicting the impact of mutations on protein stability

DDMut: predicting effects of mutations on protein stability using deep learning