Abstract:As one of the most popular computational approaches in modern structure-based drug design, molecular docking can be used not only to identify the correct conformation of a ligand within the target binding pocket but also to estimate the strength of the interaction between a target and a ligand. Nowadays, as a variety of docking programs are available for the scientific community, a comprehensive understanding of the advantages and limitations of each docking program is fundamentally important to conduct more reasonable docking studies and docking-based virtual screening. In the present study, based on an extensive dataset of 2002 protein-ligand complexes from the PDBbind database (version 2014), the performance of ten docking programs, including five commercial programs (LigandFit, Glide, GOLD, MOE Dock, and Surflex-Dock) and five academic programs (AutoDock, AutoDock Vina, LeDock, rDock, and UCSF DOCK), was systematically evaluated by examining the accuracies of binding pose prediction (sampling power) and binding affinity estimation (scoring power). Our results showed that GOLD and LeDock had the best sampling power (GOLD: 59.8% accuracy for the top scored poses; LeDock: 80.8% accuracy for the best poses) and AutoDock Vina had the best scoring power (r(p)/r(s) of 0.564/0.580 and 0.569/0.584 for the top scored poses and best poses), suggesting that the commercial programs did not show the expected better performance than the academic ones. Overall, the ligand binding poses could be identified in most cases by the evaluated docking programs but the ranks of the binding affinities for the entire dataset could not be well predicted by most docking programs. However, for some types of protein families, relatively high linear correlations between docking scores and experimental binding affinities could be achieved. To our knowledge, this study has been the most extensive evaluation of popular molecular docking programs in the last five years. It is expected that our work can offer useful information for the successful application of these docking tools to different requirements and targets.

Low-Quality Structural and Interaction Data Improves Binding Affinity Prediction via Random Forest

A machine learning approach to predicting protein–ligand binding affinity with applications to molecular docking

INCA, a Novel Human Caspase Recruitment Domain Protein That Inhibits Interleukin-1β Generation*

Does a More Precise Chemical Description of Protein–Ligand Complexes Lead to More Accurate Prediction of Binding Affinity?

A Generalized Protein-Ligand Scoring Framework with Balanced Scoring, Docking, Ranking and Screening Powers.

Further Development and Validation of Empirical Scoring Functions for Structure-Based Binding Affinity Prediction

DeepBSP—a Machine Learning Method for Accurate Prediction of Protein–Ligand Docking Structures

Synergistic Application of Molecular Docking and Machine Learning for Improved Binding Pose

Machine Learning Scoring Functions for Drug Discoveries from Experimental and Computer-Generated Protein-Ligand Structures: Towards Per-Target Scoring Functions

A high quality, industrial data set for binding affinity prediction: performance comparison in different early drug discovery scenarios

PDBBind Optimization to Create a High-Quality Protein-Ligand Binding Dataset for Binding Affinity Prediction

From P100 to P100': A new citation‐rank approach

A Diverse Benchmark Based on 3D Matched Molecular Pairs for Validating Scoring Functions

Empirical Scoring Functions for Affinity Prediction of Protein‐ligand Complexes

Leak Proof PDBBind: A Reorganized Dataset of Protein-Ligand Complexes for More Generalizable Binding Affinity Prediction

The role of binding entropy in the refinement of protein-ligand docking predictions: analysis based on the use of 11 scoring functions

Improving protein–ligand docking and screening accuracies by incorporating a scoring function correction term

ET‐score: Improving Protein‐ligand Binding Affinity Prediction Based on Distance‐weighted Interatomic Contact Features Using Extremely Randomized Trees Algorithm

Comprehensive evaluation of ten docking programs on a diverse set of protein-ligand complexes: the prediction accuracy of sampling power and scoring power.

Machine‐learning scoring functions for structure‐based drug lead optimization

Can molecular dynamics simulations improve predictions of protein-ligand binding affinity with machine learning?