Abstract:Abstract Understanding the effects of missense mutations on protein stability is a widely acknowledged significant biological problem. Genomic missense mutations may alter one or more amino acids leading to increased or decreased stability of the encoded proteins. In this study, we propose a novel approach - Protein Stability Prediction with a Gaussian Network Model (PSP-GNM) to study the effect of single amino acid substitutions on protein stability. Specifically, PSP-GNM employs a coarse-grained Gaussian Network Model (GNM) that has interactions between amino acids weighted by the Miyazawa-Jernigan (MJ) statistical potential. We use PSP-GNM to simulate partial unfolding of the wildtype and mutant structures and then, use the difference in energies of the unfolded wildtype and mutant protein structures to estimate the experimentally obtained unfolding free energy change (ΔΔG). We verify the extent of correspondence between the ΔΔG calculated by PSP-GNM and the ΔΔG obtained experimentally using three datasets: 350 forward mutations from 66 proteins, 2298 forward mutations from 126 proteins and 611 forward and reverse mutations from 66 proteins and observe Pearson correlation coefficient (PCC) as high as 0.58 and root mean-squared error (RMSE) as low as 1.24 kcal/mol. The performance is comparable to the existing state of the art methods. Importantly, we do observe an increase in the correlation to 0.73 and decrease in RMSE to 1.07 when considering only those measurements made close to 25°C and neutral pH, suggesting a strong dependence on temperature and pH. PSP-GNM is written in Python and is available as a free downloadable package at https://github.com/sambitmishra0628/PSP-GNM . Author Summary Understanding how genomic missense mutations impact the thermodynamic stability of encoded proteins is important to understand disease etiology. Specifically, mutant proteins are often functionally inactive and underlie numerous genetic and neurodegenerative diseases. A classic example is sickle cell anemia – a single amino acid change significantly affects hemoglobin’s binding affinity for oxygen. To be able to identify mutations that would likely impact the protein function and stability is therefore essential and is the focus of our study. We present an approach that relies on utilizing the intrinsic dynamics of protein structures to predict the effect of single amino acid mutations (point mutations) on protein stability. In our approach, we model proteins as coarse-grained beads (amino acids) and springs (interactions), simulate protein unfolding and identify putative residue-residue contacts that are broken during the unfolding process. We demonstrate that the knowledge of broken contacts and their order is essential in describing the thermodynamic differences between wildtype and mutant proteins. We also highlight the importance of residue-residue interactions at the mutation site in the context of protein stability prediction. Our findings present novel avenues to interpret how genomic mutations may manifest in the encoded proteins.

Exploring evolution to uncover insights into protein mutational stability

Assessing the Performance of Computational Predictors for Estimating Protein Stability Changes Upon Missense Mutations

Predicting a Protein's Stability under a Million Mutations

Protein stability models fail to capture epistatic interactions of double point mutations

Correspondence between functional scores from deep mutational scans and predicted effects on protein stability

Assessing computational tools for predicting protein stability changes upon missense mutations using a new dataset

AI challenges for predicting the impact of mutations on protein stability

Structure-based Prediction of the Effects of a Missense Variant on Protein Stability.

Protein Stability Changes upon Point Mutations Identified with a Gaussian Network Model Simulating Protein Unfolding Behavior

Biophysical inference of epistasis and the effects of mutations on protein stability and function

Comparison and evaluation of data-driven protein stability prediction models

Protein stability prediction by fine-tuning a protein language model on a mega-scale dataset

Combining Network Topological Characteristics With Sequence And Structure Based Features For Predicting Protein Stability Changes Upon Single Amino Acid Mutation

Comparing Supervised Learning and Rigorous Approach for Predicting Protein Stability upon Point Mutations in Difficult Targets

Predicting Protein Thermostability Upon Mutation Using Molecular Dynamics Timeseries Data

Deep Virtual Compton Scattering and the Nucleon Generalized Parton Distributions

Three Simple Properties Explain Protein Stability Change upon Mutation

Enhancing predictions of protein stability changes induced by single mutations using MSA-based language models

Predicting protein stability changes upon single-point mutation: a thorough comparison of the available tools on a new dataset

STRUM: structure-based prediction of protein stability changes upon single-point mutation