Abstract:Since 2009, the Tox21 project has screened ∼8500 chemicals in more than 70 high-throughput assays, generating upward of 100 million data points, with all data publicly available through partner websites at the United States Environmental Protection Agency (EPA), National Center for Advancing Translational Sciences (NCATS), and National Toxicology Program (NTP). Underpinning this public effort is the largest compound library ever constructed specifically for improving understanding of the chemical basis of toxicity across research and regulatory domains. Each Tox21 federal partner brought specialized resources and capabilities to the partnership, including three approximately equal-sized compound libraries. All Tox21 data generated to date have resulted from a confluence of ideas, technologies, and expertise used to design, screen, and analyze the Tox21 10K library. The different programmatic objectives of the partners led to three distinct, overlapping compound libraries that, when combined, not only covered a diversity of chemical structures, use-categories, and properties but also incorporated many types of compound replicates. The history of development of the Tox21 "10K" chemical library and data workflows implemented to ensure quality chemical annotations and allow for various reproducibility assessments are described. Cheminformatics profiling demonstrates how the three partner libraries complement one another to expand the reach of each individual library, as reflected in coverage of regulatory lists, predicted toxicity end points, and physicochemical properties. ToxPrint chemotypes (CTs) and enrichment approaches further demonstrate how the combined partner libraries amplify structure–activity patterns that would otherwise not be detected. Finally, CT enrichments are used to probe global patterns of activity in combined ToxCast and Tox21 activity data sets relative to test-set size and chemical versus biological end point diversity, illustrating the power of CT approaches to discern patterns in chemical–activity data sets. These results support a central premise of the Tox21 program: A collaborative merging of programmatically distinct compound libraries would yield greater rewards than could be achieved separately.This article has not yet been cited by other publications.

Enhancing the Small-Scale Screenable Biological Space beyond Known Chemogenomics Libraries with Gray Chemical Matter─Compounds with Novel Mechanisms from High-Throughput Screening Profiles

Image-Based Annotation of Chemogenomic Libraries for Phenotypic Screening

Image based annotation of Chemogenomic Libraries for Phenotypic Screening

Scalable, compressed phenotypic screening using pooled perturbations

A Mechanism to Open Academic Chemistry to High-Throughput Virtual Screening

Data-driven approaches used for compound library design, hit triage and bioactivity modeling in high-throughput screening

NanoStore: A Concept for Logistical Improvements of Compound Handling in High-Throughput Screening

Small-Molecule Library Subset Screening as an Aid for Accelerating Lead Identification

PhenoScreen: A Dual-Space Contrastive Learning Framework-based Phenotypic Screening Method by Linking Chemical Perturbations to Cellular Morphology

ChemPrint: An AI-Driven Framework for Enhanced Drug Discovery

Elucidating Compound Mechanism of Action and Polypharmacology with a Large-scale Perturbational Profile Compendium

Abstract Wrk2-04: Virtual screening of ultra-large chemical spaces for novel chemotype discovery

Predicting compound activity from phenotypic profiles and chemical structures

Canvass: A Crowd-Sourced, Natural-Product Screening Library for Exploring Biological Space.

The Pan-Canadian Chemical Library: A Mechanism to Open Academic Chemistry to High-Throughput Virtual Screening

Synergizing Chemical Structures and Bioassay Descriptions for Enhanced Molecular Property Prediction in Drug Discovery

Novel Multiplexed High Throughput Screening of Selective Inhibitors for Drug-Metabolizing Enzymes Using Human Hepatocytes

The Tox21 10K Compound Library: Collaborative Chemistry Advancing Toxicology

Customizable Generation of Synthetically Accessible, Local Chemical Subspaces

Illuminating Dark Chemical Matter Using the Cell Painting Assay

High-Throughput Screening Assay Profiling for Large Chemical Databases