Deep Learning for Protein-Ligand Docking: Are We There Yet?

Alex Morehead,Nabin Giri,Jian Liu,Jianlin Cheng
2024-10-01
Abstract:The effects of ligand binding on protein structures and their in vivo functions carry numerous implications for modern biomedical research and biotechnology development efforts such as drug discovery. Although several deep learning (DL) methods and benchmarks designed for protein-ligand docking have recently been introduced, to date no prior works have systematically studied the behavior of docking methods within the broadly applicable context of (1) using predicted (apo) protein structures for docking (e.g., for applicability to unknown structures); (2) docking multiple ligands concurrently to a given target protein (e.g., for enzyme design); and (3) having no prior knowledge of binding pockets (e.g., for unknown pocket generalization). To enable a deeper understanding of docking methods' real-world utility, we introduce PoseBench, the first comprehensive benchmark for broadly applicable protein-ligand docking. PoseBench enables researchers to rigorously and systematically evaluate DL docking methods for apo-to-holo protein-ligand docking and protein-ligand structure generation using both single and multi-ligand benchmark datasets, the latter of which we introduce for the first time to the DL community. Empirically, using PoseBench, we find that (1) DL methods consistently outperform conventional docking algorithms; (2) most recent DL docking methods fail to generalize to multi-ligand protein targets; and (3) training DL methods with physics-informed loss functions on diverse clusters of protein-ligand complexes is a promising direction for future work. Code, data, tutorials, and benchmark results are available at <a class="link-external link-https" href="https://github.com/BioinfoMachineLearning/PoseBench" rel="external noopener nofollow">this https URL</a>.
Machine Learning,Artificial Intelligence,Biomolecules,Quantitative Methods
What problem does this paper attempt to address?
The problems that this paper attempts to solve are several key challenges in protein - ligand docking, specifically including: 1. **Docking using predicted protein structures**: In many practical applications, especially for proteins with unknown structures, researchers need to be able to use predicted protein structures (for example, structures predicted by AlphaFold 3) to perform protein - ligand docking. This requires that the docking method can accurately predict the binding position and conformation of the ligand without a known binding pocket. 2. **Simultaneous docking of multiple ligands**: Many biomolecular processes involve the interaction of multiple ligands with the same target protein, such as enzyme design. Therefore, developing a docking method that can handle multiple ligands simultaneously is an important research direction. 3. **Binding pocket identification without prior knowledge**: In many cases, researchers may not know the location of the binding pocket on the protein. Therefore, the docking method needs to be able to accurately identify these binding pockets without prior knowledge. In order to systematically evaluate the performance of existing methods in these three aspects, the author introduced **POSEBENCH**, which is a comprehensive benchmarking platform for evaluating the performance of deep learning (DL) methods in protein - ligand docking. POSEBENCH includes the following innovations: - **Data sets**: POSEBENCH provides a variety of data sets, including single - ligand and multi - ligand data sets, which cover different types of protein - ligand complexes. - **Task definitions**: POSEBENCH defines two tasks, single - ligand blind docking and multi - ligand blind docking, to evaluate the performance of methods in different scenarios. - **Evaluation metrics**: POSEBENCH uses multiple structural accuracy and molecular validity metrics to evaluate the docking results, ensuring comprehensiveness and accuracy of the evaluation. Through these systematic evaluations, the author found that: 1. **DL methods are superior to traditional methods**: In most cases, the performance of deep learning methods is better than that of traditional docking algorithms. 2. **Challenges in multi - ligand docking**: Most of the existing deep learning docking methods perform poorly in multi - ligand docking tasks, indicating that further research is still needed in this area. 3. **The importance of physics - informed loss functions**: When training deep learning models, using physics - informed loss functions (such as van der Waals collision loss) can significantly improve the generalization ability of the model, especially in multi - ligand docking tasks. In conclusion, this paper systematically evaluated the performance of deep learning methods in protein - ligand docking by introducing the POSEBENCH benchmarking platform and pointed out important directions for future research.