A Mechanism to Open Academic Chemistry to High-Throughput Virtual Screening

Corentin Bedart,Grace Shimokura,Frederick G West,Tabitha E Wood,Robert A Batey,John J Irwin,Matthieu Schapira
DOI: https://doi.org/10.26434/chemrxiv-2023-jgbgv
2023-12-08
Abstract:Computationally screening chemical libraries to discover molecules with desired properties is a common technique used in early-stage drug discovery. Recent progress in the field now enables the efficient exploration of billions of molecules within days or hours, but this exploration remains confined within the boundaries of the accessible chemistry space. While the number of commercially available compounds grows rapidly, it remains a limited subset of molecules that could be synthesized. Here, we present a workflow where chemical reactions typically developed in academia and unconventional in drug discovery are exploited to dramatically expand the chemistry space accessible to virtual screening. We use this process to generate a first version of the Pan-Canadian Chemical Library, a collection of nearly 150 billion diverse compounds that does not overlap with other ultra-large libraries such as Enamine REAL or SAVI and could be a resource of choice for protein targets where other libraries have failed to deliver bioactive molecules. A 127 million compound subset of the library is available at https://pccl.thesgc.org/.
Chemistry
What problem does this paper attempt to address?
This paper presents a method to expand the accessible chemical space in virtual screening for drug discovery. The current virtual screening is mainly limited to known commercial compounds, while the workflow described in this paper utilizes unique chemical reactions developed in academic laboratories to generate a large chemical library called the Pan-Canadian Chemical Library (PCCL), which consists of nearly 150 billion compounds. This library does not overlap with existing large commercial libraries and includes 128 million molecules with drug-like properties and low cost. The researchers selected specific chemical reactions from academic laboratories at the University of Toronto, the University of Winnipeg, and the University of Alberta, such as β-ketoimine, 5-aminotriazole, 5-aminotetrazole, Truce-Smiles rearrangement, [2+2] cycloaddition, and [4+2] cycloaddition, to generate compounds by combining these reactions with compatible commercial reagents. These reactions allow for the generation of a large number of previously unexplored chemical structures in drug discovery. The aim of PCCL is to break the boundaries of the existing chemical space by introducing novel chemical reactions and discovering new biologically active molecules for protein target discovery, especially those targets for which active molecules cannot be found in other libraries. This resource is of great importance for expanding the frontiers of chemical exploration and the development of precision medicine. The paper also provides detailed descriptions of chemical reactions, reactant selection and filtering rules, as well as statistical information on the generated compound collection.