Genetics, energetics and allostery during a billion years of hydrophobic protein core evolution

Albert Escobedo,Gesa Voigt,Andre J Faure,Ben Lehner
DOI: https://doi.org/10.1101/2024.05.11.593672
2024-05-12
Abstract:Protein folding is driven by the burial of hydrophobic amino acids in a tightly-packed core that excludes water. The genetics, biophysics and evolution of hydrophobic cores are not well understood, in part because of a lack of systematic experimental data on sequence combinations that do - and do not - constitute stable and functional cores. Here we randomize protein hydrophobic cores and evaluate their stability and function at scale. The data show that vast numbers of amino acid combinations can constitute stable protein cores but that these alternative cores frequently disrupt protein function because of allosteric effects. These strong allosteric effects are not due to complicated, highly epistatic fitness landscapes but rather, to the pervasive nature of allostery, with many individually small energy changes combining to disrupt function. Indeed both protein stability and ligand binding can be accurately predicted over very large evolutionary distances using additive energy models with a small contribution from pairwise energetic couplings. As a result, energy models trained on one protein can accurately predict core stability across hundreds of millions of years of protein evolution, with only rare energetic couplings that we experimentally identify limiting the transplantation of cores between highly diverged proteins. Our results reveal the simple energetic architecture of protein hydrophobic cores and suggest that allostery is a major constraint on sequence evolution.
Biophysics
What problem does this paper attempt to address?
This paper mainly discusses the genetic, energetic, and allosteric effects of the hydrophobic amino acid residues in the protein core region over billions of years of evolution. The researchers randomized the hydrophobic core of proteins and evaluated their stability and functionality on a large scale. They found that a large number of amino acid combinations can form stable protein cores, but these alternative cores often disrupt protein function due to allosteric effects. This strong allosteric effect is not due to a complex and highly epistatic adaptive landscape, but rather to the ubiquity of allosteric effects, where many small energy changes collectively lead to functional impairment. In addition, protein stability and ligand binding can be accurately predicted by an additive energy model over very large evolutionary distances, with only a small fraction of energy couplings limiting core transplantation between highly divergent proteins. The core randomization experiments mentioned in the paper reveal a simple energy architecture of the protein hydrophobic core and indicate that allosteric effects may be the primary constraints in sequence evolution. Although most core mutations have a negative impact on stability, there are still a large number of proteins containing multiple mutations that remain stable. Furthermore, the experiments show that the energetics of many internal mutations propagate to the surface, and the universality of this allosteric effect suggests a significant impact of allosteric effects in sequence evolution. In conclusion, the paper aims to understand the relationship between the evolution, structure, and function of protein hydrophobic cores, as well as the impact of allosteric effects on protein function. Through experimental data, the paper reveals that the genetic and energetic architecture of the core region is simpler than expected, and allosteric effects play a key role in protein function and sequence evolution.