VAD-MM/GBSA: A Variable Atomic Dielectric MM/GBSA Model for Improved Accuracy in Protein–Ligand Binding Free Energy Calculations
Ercheng Wang,Weitao Fu,Dejun Jiang,Huiyong Sun,Junmei Wang,Xujun Zhang,Gaoqi Weng,Hui Liu,Peng Tao,Tingjun Hou
DOI: https://doi.org/10.1021/acs.jcim.1c00091
IF: 6.162
2021-05-20
Journal of Chemical Information and Modeling
Abstract:The molecular mechanics/generalized Born surface area (MM/GBSA) has been widely used in end-point binding free energy prediction in structure-based drug design (SBDD). However, in practice, it is usually being treated as a disputed method mostly because of its system dependence. Here, combining with machine-learning optimization, we developed a novel version of MM/GBSA, named variable atomic dielectric MM/GBSA (VAD-MM/GBSA), by assigning variable dielectric constants directly to the protein/ligand atoms. The new strategy exhibits markedly improved accuracy in binding affinity calculations for various protein–ligand systems and is promising to be used in the postprocessing of structure-based virtual screening. Moreover, VAD-MM/GBSA outperformed prime MM/GBSA in Schrödinger software and showed remarkable predictive performance for specific protein targets, such as POL polyprotein, human immunodeficiency virus type 1 (HIV-1) protease, etc. Our study showed that the VAD-MM/GBSA method with little extra computational overhead provides a potential replacement of the MM/GBSA in AMBER software. An online web server of VAD-MMGBSA has been developed and is now available at <a class="extLink" href="http://cadd.zju.edu.cn/vdgb">http://cadd.zju.edu.cn/vdgb</a>.The Supporting Information is available free of charge at <a class="ext-link" href="/doi/10.1021/acs.jcim.1c00091?goto=supporting-info">https://pubs.acs.org/doi/10.1021/acs.jcim.1c00091</a>.Training set and test sets 1, 2, and 3 used in this work, respectively (Tables S1–S4); descriptors of ligands and pockets used in XGBoost regression with feature importance (Table S5); the distributions of descriptors (feature importance >0.03) of ligands and pockets for the complexes in the training set (Table S6); the values of descriptors (feature importance >0.03) of ligands and pockets for the eight protein-target families described in <a class="internalNav" href="#fig2">Figure </a><a class="internalNav" href="#fig2">2</a> (Tables S7–S14); the mean dielectric values obtained with the VAD model (Tables S15 and S16); the search space in annealing-based optimization (Figure S1); the two coefficients <i>a</i> and <i>b</i> are related to the ligand volumes (Figure S2); residual plots and binding free energy prediction comparison using VAD-MM/GBSA and VD114-MM/GBSA for the eight protein-target families described in <a class="internalNav" href="#fig2">Figure </a><a class="internalNav" href="#fig2">2</a> (Figure S3) (<a class="ext-link" href="/doi/suppl/10.1021/acs.jcim.1c00091/suppl_file/ci1c00091_si_001.pdf">PDF</a>)This article has not yet been cited by other publications.
chemistry, multidisciplinary, medicinal,computer science, interdisciplinary applications, information systems