Abstract:The effects of ligand binding on protein structures and their in vivo functions carry numerous implications for modern biomedical research and biotechnology development efforts such as drug discovery. Although several deep learning (DL) methods and benchmarks designed for protein-ligand docking have recently been introduced, to date no prior works have systematically studied the behavior of docking methods within the broadly applicable context of (1) using predicted (apo) protein structures for docking (e.g., for applicability to unknown structures); (2) docking multiple ligands concurrently to a given target protein (e.g., for enzyme design); and (3) having no prior knowledge of binding pockets (e.g., for unknown pocket generalization). To enable a deeper understanding of docking methods' real-world utility, we introduce PoseBench, the first comprehensive benchmark for broadly applicable protein-ligand docking. PoseBench enables researchers to rigorously and systematically evaluate DL docking methods for apo-to-holo protein-ligand docking and protein-ligand structure generation using both single and multi-ligand benchmark datasets, the latter of which we introduce for the first time to the DL community. Empirically, using PoseBench, we find that (1) DL methods consistently outperform conventional docking algorithms; (2) most recent DL docking methods fail to generalize to multi-ligand protein targets; and (3) training DL methods with physics-informed loss functions on diverse clusters of protein-ligand complexes is a promising direction for future work. Code, data, tutorials, and benchmark results are available at <a class="link-external link-https" href="https://github.com/BioinfoMachineLearning/PoseBench" rel="external noopener nofollow">this https URL</a>.

What problem does this paper attempt to address?

The problems that this paper attempts to solve are several key challenges in protein - ligand docking, specifically including: 1. **Docking using predicted protein structures**: In many practical applications, especially for proteins with unknown structures, researchers need to be able to use predicted protein structures (for example, structures predicted by AlphaFold 3) to perform protein - ligand docking. This requires that the docking method can accurately predict the binding position and conformation of the ligand without a known binding pocket. 2. **Simultaneous docking of multiple ligands**: Many biomolecular processes involve the interaction of multiple ligands with the same target protein, such as enzyme design. Therefore, developing a docking method that can handle multiple ligands simultaneously is an important research direction. 3. **Binding pocket identification without prior knowledge**: In many cases, researchers may not know the location of the binding pocket on the protein. Therefore, the docking method needs to be able to accurately identify these binding pockets without prior knowledge. In order to systematically evaluate the performance of existing methods in these three aspects, the author introduced **POSEBENCH**, which is a comprehensive benchmarking platform for evaluating the performance of deep learning (DL) methods in protein - ligand docking. POSEBENCH includes the following innovations: - **Data sets**: POSEBENCH provides a variety of data sets, including single - ligand and multi - ligand data sets, which cover different types of protein - ligand complexes. - **Task definitions**: POSEBENCH defines two tasks, single - ligand blind docking and multi - ligand blind docking, to evaluate the performance of methods in different scenarios. - **Evaluation metrics**: POSEBENCH uses multiple structural accuracy and molecular validity metrics to evaluate the docking results, ensuring comprehensiveness and accuracy of the evaluation. Through these systematic evaluations, the author found that: 1. **DL methods are superior to traditional methods**: In most cases, the performance of deep learning methods is better than that of traditional docking algorithms. 2. **Challenges in multi - ligand docking**: Most of the existing deep learning docking methods perform poorly in multi - ligand docking tasks, indicating that further research is still needed in this area. 3. **The importance of physics - informed loss functions**: When training deep learning models, using physics - informed loss functions (such as van der Waals collision loss) can significantly improve the generalization ability of the model, especially in multi - ligand docking tasks. In conclusion, this paper systematically evaluated the performance of deep learning methods in protein - ligand docking by introducing the POSEBENCH benchmarking platform and pointed out important directions for future research.

Deep Learning for Protein-Ligand Docking: Are We There Yet?

PoseBusters: AI-based docking methods fail to generate physically valid poses or generalise to novel sequences

DOCKSTRING: Easy Molecular Docking Yields Better Benchmarks for Ligand Design

Harnessing Deep Learning for Enhanced Ligand Docking.

Addressing docking pose selection with structure-based deep learning: Recent advances, challenges and opportunities

Combining Docking Pose Rank and Structure with Deep Learning Improves Protein–Ligand Binding Mode Prediction over a Baseline Docking Approach

ApoDock: Ligand-Conditioned Sidechain Packing for Flexible Molecular Docking

DeltaDock: A Unified Framework for Accurate, Efficient, and Physically Reliable Molecular Docking

Pre-Training on Large-Scale Generated Docking Conformations with HelixDock to Unlock the Potential of Protein-ligand Structure Prediction Models

Deep Confident Steps to New Pockets: Strategies for Docking Generalization

Deep Learning Model for Efficient Protein–Ligand Docking with Implicit Side-Chain Flexibility

Do Deep Learning Models Really Outperform Traditional Approaches in Molecular Docking?

Uni-Mol Docking V2: Towards Realistic and Accurate Binding Pose Prediction

DeepDock: Enhancing Ligand-protein Interaction Prediction by a Combination of Ligand and Structure Information

DeepBSP—a Machine Learning Method for Accurate Prediction of Protein–Ligand Docking Structures

Deep-Learning Based Docking Methods: Fair Comparisons to Conventional Docking Workflows

Enhancing Ligand Pose Sampling for Molecular Docking

DiffDock: Diffusion Steps, Twists, and Turns for Molecular Docking

Re-Dock: Towards Flexible and Realistic Molecular Docking with Diffusion Bridge

Physics-inspired accuracy estimator for model-docked ligand complexes