Abstract:Successful drug discovery projects require control and optimization of compound properties related to pharmacokinetics, pharmacodynamics, and safety. While volume and chemotype coverage of public and corporate ADME-Tox (absorption, distribution, excretion, metabolism, and toxicity) databases are constantly growing, deep neural nets (DNN) emerged as transformative artificial intelligence technology to analyze those challenging data. Relevant features are automatically identified, while appropriate data can also be combined to multitask networks to evaluate hidden trends among multiple ADME-Tox parameters for implicitly correlated data sets. Here we describe a novel, fully industrialized approach to parametrize and optimize the setup, training, application, and visual interpretation of DNNs to model ADME-Tox data. Investigated properties include microsomal lability in different species, passive permeability in Caco-2/TC7 cells, and logD. Statistical models are developed using up to 50 000 compounds from public or corporate databases. Both the choice of DNN hyperparameters and the type and quantity of molecular descriptors were found to be important for successful DNN modeling. Alternate learning of multiple ADME-Tox properties, resulting in a multitask approach, performs statistically superior on most studied data sets in comparison to DNN single-task models and also provides a scalable method to predict ADME-Tox properties from heterogeneous data. For example, predictive quality using external validation sets was improved from R<sup>2</sup> of 0.6 to 0.7 comparing single-task and multitask DNN networks from human metabolic lability data. Besides statistical evaluation, a new visualization approach is introduced to interpret DNN models termed "response map", which is useful to detect local property gradients based on structure fragmentation and derivatization. This method is successfully applied to visualize fragmental contributions to guide further design in drug discovery programs, as illustrated by CRCX3 antagonists and renin inhibitors, respectively.

Efficient Toxicity Prediction via Simple Features Using Shallow Neural Networks and Decision Trees

Toxicity Prediction using Deep Learning

A deep learning based multi-model approach for predicting drug-like chemical compound's toxicity

Deep Learning Based Regression and Multiclass Models for Acute Oral Toxicity Prediction with Automatic Chemical Feature Extraction.

Deep learning for predicting toxicity of chemicals: a mini review

Explainable AI and tree-based ensemble models: a comparative study in predicting chemical pulmonary toxicity

Co‐Model for Chemical Toxicity Prediction Based on Multi‐task Deep Learning

Deep Learning Based Regression and Multi-class Models for Acute Oral Toxicity Prediction with Automatic Chemical Feature Extraction

Explaining Chemical Toxicity using Missing Features

Novel Approach of Deep Learning in Toxicity Prediction

Multitask CapsNet: an Imbalanced Data Deep Learning Method for Predicting Toxicants

Toxicity Detection in Drug Candidates using Simplified Molecular-Input Line-Entry System

Accurate Clinical Toxicity Prediction using Multi-task Deep Neural Nets and Contrastive Molecular Explanations

Deep-learning: investigating deep neural networks hyper-parameters and comparison of performance to shallow methods for modeling bioactivity data

Rapid Prediction of Chemical Ecotoxicity Through Genetic Algorithm Optimized Neural Network Models

Drug Toxicity Prediction by Machine Learning Approaches

TOP: Towards Better Toxicity Prediction by Deep Molecular Representation Learning

Predictive Multitask Deep Neural Network Models for ADME-Tox Properties: Learning from Large Data Sets

deepFPlearn +: enhancing toxicity prediction across the chemical universe using graph neural networks

Artificial Intelligence-Based Toxicity Prediction of Environmental Chemicals: Future Directions for Chemical Management Applications.

Step Change Improvement in ADMET Prediction with PotentialNet Deep Featurization