Equilibrium and Non-equilibrium Ensemble Methods for Accurate, Precise and Reproducible Absolute Binding Free Energy Calculations

Agastya Prakash Bhati,Shunzhou Wan,Peter V. Coveney
DOI: https://doi.org/10.26434/chemrxiv-2024-sslzp
2024-07-11
Abstract:Free energy calculations for protein-ligand complexes have become widespread in recent years owing to several conceptual, methodological and technological advances. Central among these is the use of ensemble methods which permits accurate, precise and reproducible predictions. Abso- lute binding free energies (ABFEs) are challenging to predict using alchemical methods and their routine application in drug discovery has remained out of reach until now. Here, we apply en- semble alchemical ABFE methods to a large dataset comprising 219 ligand-protein complexes and obtain statistically robust results with high accuracy (< 1 kcal/mol). We compare equilibrium and non-equilibrium methods for ABFE predictions at large scale and provide a systematic critical as- sessment of each method. The equilibrium method is more accurate, precise, faster, computationally more cost-effective and requires a much simpler protocol, making it preferable for large scale and blind applications. We find that the calculated free energy distributions are non-normal and discuss the consequences. We recommend a definitive protocol to perform ABFE calculations optimally. Using this protocol, it is possible to perform thousands of ABFE calculations within a few hours on modern exascale machines.
Chemistry
What problem does this paper attempt to address?
This paper discusses methods for accurately, precisely, and reproducibly calculating the absolute binding free energy (ABFE) in protein-ligand complexes. Currently, due to advancements in concepts, methods, and technologies, free energy calculations have become increasingly common. However, predicting ABFE using alchemical approaches remains challenging and has not yet been achieved in routine applications in drug discovery. In this study, the authors applied two methods, the equilibrium method and the nonequilibrium method, to predict ABFE on a large scale and systematically evaluated these two methods. They found that the equilibrium method performs better in terms of accuracy, precision, speed, computational cost-effectiveness, and simplicity of the protocol, making it more suitable for large-scale and blind applications. Additionally, the study revealed that the calculated free energy distribution is non-Gaussian and discussed its consequences. The paper proposes a recommended protocol to optimize the execution of ABFE calculations. With this protocol, thousands of ABFE calculations can be completed within a few hours on modern exascale machines. Overall, the paper aims to address how to improve and compare different computational methods used for predicting protein-ligand binding free energy to enhance the prediction accuracy in the drug discovery process.