FRAGSITE: A Fragment-Based Approach for Virtual Ligand Screening
Hongyi Zhou,Hongnan Cao,Jeffrey Skolnick
DOI: https://doi.org/10.1021/acs.jcim.0c01160
IF: 6.162
2021-03-16
Journal of Chemical Information and Modeling
Abstract:To reduce time and cost, virtual ligand screening (VLS) often precedes experimental ligand screening in modern drug discovery. Traditionally, high-resolution structure-based docking approaches rely on experimental structures, while ligand-based approaches need known binders to the target protein and only explore their nearby chemical space. In contrast, our structure-based FINDSITE<sup>comb2.0</sup> approach takes advantage of predicted, low-resolution structures and information from ligands that bind distantly related proteins whose binding sites are similar to the target protein. Using a boosted tree regression machine learning framework, we significantly improved FINDSITE<sup>comb2.0</sup> by integrating ligand fragment scores as encoded by molecular fingerprints with the global ligand similarity scores of FINDSITE<sup>comb2.0</sup>. The new approach, FRAGSITE, exploits our observation that ligand fragments, e.g., rings, tend to interact with stereochemically conserved protein subpockets that also occur in evolutionarily unrelated proteins. FRAGSITE was benchmarked on the 102 protein DUD-E set, where any template protein whose sequence identify >30% to the target was excluded. Within the top 100 ranked molecules, FRAGSITE improves VLS precision and recall by 14.3 and 18.5%, respectively, relative to FINDSITE<sup>comb2.0</sup>. Moreover, the mean top 1% enrichment factor increases from 25.2 to 30.2. On average, both outperform state-of-the-art deep learning-based methods such as AtomNet. On the more challenging unbiased set LIT-PCBA, FRAGSITE also shows better performance than ligand similarity-based and docking approaches such as two-dimensional ECFP4 and Surflex-Dock v.3066. On a subset of 23 targets from DEKOIS 2.0, FRAGSITE shows much better performance than the boosted tree regression-based, vScreenML scoring function. Experimental testing of FRAGSITE's predictions shows that it has more hits and covers a more diverse region of chemical space than FINDSITE<sup>comb2.0</sup>. For the two proteins that were experimentally tested, DHFR, a well-studied protein that catalyzes the conversion of dihydrofolate to tetrahydrofolate, and the kinase ACVR1, FRAGSITE identified new small-molecule nanomolar binders. Interestingly, one new binder of DHFR is a kinase inhibitor predicted to bind in a new subpocket. For ACVR1, FRAGSITE identified new molecules that have diverse scaffolds and estimated nanomolar to micromolar affinities. Thus, FRAGSITE shows significant improvement over prior state-of-the-art ligand virtual screening approaches. A web server is freely available for academic users at <a class="extLink" href="http://http:/sites.gatech.edu/cssb/FRAGSITE">http:/sites.gatech.edu/cssb/FRAGSITE</a>.The Supporting Information is available free of charge at <a class="ext-link" href="/doi/10.1021/acs.jcim.0c01160?goto=supporting-info">https://pubs.acs.org/doi/10.1021/acs.jcim.0c01160</a>.Additional information for experimental methods and (Tables S1 and S2) top 40 most frequent fragments in DrugBank drugs and performance of methods for individual DUD-E targets using the modeled target structure (<a class="ext-link" href="/doi/suppl/10.1021/acs.jcim.0c01160/suppl_file/ci0c01160_si_001.pdf">PDF</a>)This article has not yet been cited by other publications.
chemistry, multidisciplinary, medicinal,computer science, interdisciplinary applications, information systems