Does Machine Learning Learn the Physics for Pose Ranking of Fragment-Sized Ligands? A Comparison between Machine Learning and Physics-based Methods

xiao wan,Bai Xue,Michael Bellucci,Zhixiong Lin,Mingjun Yang,Junjie Zou
DOI: https://doi.org/10.26434/chemrxiv-2024-020gt
2024-05-24
Abstract:In fragment-based drug discovery using in silico methods, predicting the binding pose is a crucial step to ensure the accurate prediction of binding affinities. Recent studies have focused on the challenges of docking fragments compared to drug-like molecules, with findings suggesting that more sophisticated scoring functions can improve the accuracy of identifying correct binding poses. In this work, we conducted extensive ABFEP (Alchemical Binding Free Energy Perturbation) calculations on a fragment benchmarking dataset to evaluate the accuracy of ABFEP in ranking binding poses of fragments and compared ABFEP rescoring with Vina and two machine learning (ML)-based scoring functions. Indeed, ABFEP, which has a theoretically more rigorous scoring function, significantly outperforms Vina. In-depth comparison between ML-based scoring functions and ABFEP shows that ML-based scoring functions behave similarly to ABFEP on the prediction accuracy and failed cases, indicating that ML is capable of increasing the prediction accuracy over traditional scoring functions through learning the underlying physics rather than memorizing the coordinates in the training data.
Chemistry
What problem does this paper attempt to address?
This paper discusses the problem of predicting binding conformations of fragment-sized ligands in drug discovery. The study evaluates the binding conformations of ligands by comparing machine learning (ML) methods with physics-based methods such as Alchemical Binding Free Energy Perturbation (ABFEP). The paper points out that although ABFEP theoretically has a more rigorous scoring function, it significantly outperforms traditional methods like Vina in ranking fragment binding conformations. Furthermore, machine learning scoring functions show similar performance to ABFEP in terms of prediction accuracy and failure cases, indicating that ML can improve prediction accuracy by learning the underlying physical principles of protein-fragment binding instead of simply memorizing the coordinates in training data. In this study, the authors used a benchmark dataset comprising 93 high-quality protein-fragment complexes and conducted extensive ABFEP calculations to evaluate its ability to distinguish correct and incorrect binding conformations. The results demonstrate that ABFEP outperforms Vina for fragments with fewer hydrogen bonds formed with the receptor. The paper also analyzes the performance of machine learning scoring functions in systems with experimental uncertainty and finds that their prediction accuracy in these cases is similar to that of ABFEP, further demonstrating the potential of ML in learning physical rules. The paper also discusses the reasons for the failures of ABFEP in certain cases and provides improvement strategies, as well as emphasizes the importance of hydrogen bonds in identifying the correct binding conformations. Through these analyses, the study provides insights into improving the accuracy of docking scoring.