Uni-Mol Docking V2: Towards Realistic and Accurate Binding Pose Prediction

Eric Alcaide,Zhifeng Gao,Guolin Ke,Yaqi Li,Linfeng Zhang,Hang Zheng,Gengmo Zhou
2024-05-20
Abstract:In recent years, machine learning (ML) methods have emerged as promising alternatives for molecular docking, offering the potential for high accuracy without incurring prohibitive computational costs. However, recent studies have indicated that these ML models may overfit to quantitative metrics while neglecting the physical constraints inherent in the problem. In this work, we present Uni-Mol Docking V2, which demonstrates a remarkable improvement in performance, accurately predicting the binding poses of 77+% of ligands in the PoseBusters benchmark with an RMSD value of less than 2.0 Å, and 75+% passing all quality checks. This represents a significant increase from the 62% achieved by the previous Uni-Mol Docking model. Notably, our Uni-Mol Docking approach generates chemically accurate predictions, circumventing issues such as chirality inversions and steric clashes that have plagued previous ML models. Furthermore, we observe enhanced performance in terms of high-quality predictions (RMSD values of less than 1.0 Å and 1.5 Å) and physical soundness when Uni-Mol Docking is combined with more physics-based methods like Uni-Dock. Our results represent a significant advancement in the application of artificial intelligence for scientific research, adopting a holistic approach to ligand docking that is well-suited for industrial applications in virtual screening and drug design. The code, data and service for Uni-Mol Docking are publicly available for use and further development in
Biomolecules,Machine Learning,Biological Physics
What problem does this paper attempt to address?
This paper aims to address issues in molecular docking, particularly by improving the accuracy and physical rationality of binding conformation prediction through enhancing machine learning (ML) models. Current ML models may suffer from overfitting quantitative metrics and neglect the physical constraints of the problem itself. The paper introduces Uni-Mol Docking V2, a system that significantly enhances performance and accurately predicts the binding conformations of over 77% of ligands in the PoseBusters benchmark test set with an RMSD value less than 2.0 Å. Moreover, over 75% of the predictions pass all quality checks. Compared to the previous version of Uni-Mol Docking, this new version has been enhanced in terms of prediction quality and physical rationality. The authors also point out that when Uni-Mol Docking V2 is used in combination with more physics-based methods such as Uni-Dock, it exhibits outstanding performance in high-precision prediction (RMSD values less than 1.0 Å and 1.5 Å) and physical robustness, making it suitable for industrial applications like virtual screening and drug design. Furthermore, this model addresses issues of chirality inversion and spatial conflicts, which were present in previous ML models. The paper provides open-source code, data, and services to facilitate further research and development.