DeepBSP—a Machine Learning Method for Accurate Prediction of Protein–Ligand Docking Structures

Jingxiao Bao,Xiao He,John Z. H. Zhang
DOI: https://doi.org/10.1021/acs.jcim.1c00334
IF: 6.162
2021-05-12
Journal of Chemical Information and Modeling
Abstract:In recent years, machine-learning-based scoring functions have significantly improved the scoring power. However, many of these methods do not perform well in distinguishing the native structure from docked decoy poses due to the lack of decoy structural information in their training data. Here, we developed a machine-learning model, named DeepBSP, that can directly predict the root mean square deviation (rmsd) of a ligand docking pose with reference to its native binding pose. Unlike the binding affinity, the rmsd between the docking poses with reference to their native structures can be straightforwardly determined. By training on a generated data set with 11,925 native complexes and more than 165,000 docked poses, our model shows excellent docking power on our test set and also on the CASF-2016 docking decoy set compared to other major scoring functions. Thus, by combining molecular dockings that generate many poses with the application of DeepBSP, one can more accurately predict the best binding pose that is closest to the native complex structure. This DeepBSP model shall be very useful in picking out poses close to their natives from many poses generated from a dock application.The Supporting Information is available free of charge at <a class="ext-link" href="/doi/10.1021/acs.jcim.1c00334?goto=supporting-info">https://pubs.acs.org/doi/10.1021/acs.jcim.1c00334</a>.Resolution and ligand heavy atom number distributions of the data set; severe atomic VDW collisions within complex structures; ligand rotatable bond number distributions of the training and test set; docking power of DeepBSP and scoring functions benchmarked in CASF-2016 in its docking decoy set (<a class="ext-link" href="/doi/suppl/10.1021/acs.jcim.1c00334/suppl_file/ci1c00334_si_001.pdf">PDF</a>)This article has not yet been cited by other publications.
chemistry, multidisciplinary, medicinal,computer science, interdisciplinary applications, information systems
What problem does this paper attempt to address?