Enhancing the Small-Scale Screenable Biological Space beyond Known Chemogenomics Libraries with Gray Chemical Matter─Compounds with Novel Mechanisms from High-Throughput Screening Profiles

Jason R Thomas,Claude Shelton 4th,Jason Murphy,Scott Brittain,Mark-Anthony Bray,Peter Aspesi,John Concannon,Frederick J King,Robert J Ihry,Daniel J Ho,Martin Henault,Andrea Hadjikyriacou,Marilisa Neri,Frederic D Sigoillot,Helen T Pham,Matthew Shum,Louise Barys,Michael D Jones,Eric J Martin,Anke Blechschmidt,Sébastien Rieffel,Thomas J Troxler,Felipa A Mapa,Jeremy L Jenkins,Rishi K Jain,Peter S Kutchukian,Markus Schirle,Steffen Renner
DOI: https://doi.org/10.1021/acschembio.3c00737
2024-04-19
Abstract:Phenotypic assays have become an established approach to drug discovery. Greater disease relevance is often achieved through cellular models with increased complexity and more detailed readouts, such as gene expression or advanced imaging. However, the intricate nature and cost of these assays impose limitations on their screening capacity, often restricting screens to well-characterized small compound sets such as chemogenomics libraries. Here, we outline a cheminformatics approach to identify a small set of compounds with likely novel mechanisms of action (MoAs), expanding the MoA search space for throughput limited phenotypic assays. Our approach is based on mining existing large-scale, phenotypic high-throughput screening (HTS) data. It enables the identification of chemotypes that exhibit selectivity across multiple cell-based assays, which are characterized by persistent and broad structure activity relationships (SAR). We validate the effectiveness of our approach in broad cellular profiling assays (Cell Painting, DRUG-seq, and Promotor Signature Profiling) and chemical proteomics experiments. These experiments revealed that the compounds behave similarly to known chemogenetic libraries, but with a notable bias toward novel protein targets. To foster collaboration and advance research in this area, we have curated a public set of such compounds based on the PubChem BioAssay dataset and made it available for use by the scientific community.
What problem does this paper attempt to address?