Substitution models of protein evolution with selection on enzymatic activity
David Ferreiro,Ruqaiya Khalil,Sergio F Sousa,Miguel Arenas
DOI: https://doi.org/10.1093/molbev/msae026
IF: 10.7
2024-02-06
Molecular Biology and Evolution
Abstract:Substitution models of evolution are necessary for diverse evolutionary analyses including phylogenetic tree and ancestral sequence reconstructions. At the protein level, empirical substitution models are traditionally used due to their simplicity, but they ignore the variability of substitution patterns among protein sites. Next, in order to improve the realism of the modeling of protein evolution, a series of structurally constrained substitution models were presented, but still they usually ignore constraints on the protein activity. Here we present a substitution model of protein evolution with selection on both protein structure and enzymatic activity, and that can be applied to phylogenetics. In particular, the model considers the binding affinity of the enzyme-substrate complex as well as structural constraints, that include the flexibility of structural flaps, hydrogen bonds, amino acids backbone radius of gyration and solvent-accessible surface area, that are quantified through molecular dynamics simulations. We applied the model to the HIV-1 protease and evaluated it by phylogenetic likelihood in comparison with the best-fitting empirical substitution model and a structurally constrained substitution model that ignores the enzymatic activity. We found that accounting for selection on the protein activity improves the fitting of the modeled functional regions with the real observations, especially in data with high molecular identity, which recommends considering constraints on the protein activity in the development of substitution models of evolution.
genetics & heredity,biochemistry & molecular biology,evolutionary biology