Do all roads lead to Rome? Convergence issues in umbrella sampling simulations

Pavel Buslaev,Noora Aho,Gerrit Groenhof
DOI: https://doi.org/10.26434/chemrxiv-2023-2pqls
2023-11-30
Abstract:Molecular dynamics (MD) simulations are widely applied to estimate absolute binding free energies of protein-ligand and protein-protein complexes. A routinely used method for binding free energy calculations with MD is umbrella sampling (US), which calculates the potential of mean force (PMF) along a reaction coordinate. In this work, we investigate the convergence of US along standard distance-based reaction coordinates for various protein-protein and protein-ligand complexes, following commonly used guidelines for the setup. We show that repeating the complete US workflow can lead to differences of 2-20 kcal/mol in computed binding free energies. We attribute those discrepancies to small differences in the binding pathways. We then demonstrate that adaptive-biasing approaches, which are constructed to sample multiple pathways in a single simulation, such as the accelerated weight histogram (AWH) method, can achieve convergence between independent simulations. To the best of our knowledge, this is the first attempt to systematically assess the shortcomings of the widely accepted protocols for US of protein-protein and protein-ligand binding affinities. We anticipate therefore that our results will provide an incentive for a critical reassessment of the validity of PMFs computed with US, as well as adopt adaptive-biasing approaches for computing binding affinities.
Chemistry
What problem does this paper attempt to address?
The paper focuses on the convergence problem of Umbrella Sampling (US), a common method for calculating the binding free energy of protein-protein and protein-ligand complexes in molecular dynamics (MD) simulations. The study found that even when following the standard guidelines, repeating the US workflow could lead to differences in calculated binding free energy of 2-20 kcal/mol. This is attributed to small differences in binding pathways. The paper also demonstrates that using adaptive biasing methods, such as Accelerated Weight Histogram (AWH) method, which can sample multiple pathways simultaneously, can achieve convergence between independent simulations. Therefore, the study emphasizes the importance of sampling multiple pathways when calculating binding affinity and suggests the need for re-evaluating the potential of mean force (PMF) obtained from US calculations.