Abstract:Although computational predictions of pharmacokinetics (PK) are desirable at the drug design stage, existing approaches are often limited by prediction accuracy and human interpretability. Using a discovery data set of mouse and rat PK studies at Roche (9,685 unique compounds), we performed a proof-of-concept study to predict key PK properties from chemical structure alone, including plasma clearance (CLp), volume of distribution at steady-state (Vss), and oral bioavailability (F). Ten machine learning (ML) models were evaluated, including Single-Task, Multitask, and transfer learning approaches (i.e., pretraining with in vitro data). In addition to prediction accuracy, we emphasized human interpretability of outcomes, especially the quantification of uncertainty, applicability domains, and explanations of predictions in terms of molecular features. Results show that intravenous (IV) PK properties (CLp and Vss) can be predicted with good precision (average absolute fold error, AAFE of 1.96-2.84 depending on data split) and low bias (average fold error, AFE of 0.98-1.36), with AutoGluon, Gaussian Process Regressor (GP), and ChemProp displaying the best performance. Driven by higher complexity of oral PK studies, predictions of F were more challenging, with the best AAFE values of 2.35-2.60 and higher overprediction bias (AFE of 1.45-1.62). Multi-Task approaches and pretraining of ChemProp neural networks with in vitro data showed similar precision to Single-Task models but helped reduce the bias and increase correlations between observations and predictions. A combination of GP-computed prediction variance, molecular clustering, and dimensionality-reduction provided valuable quantitative insights into prediction uncertainty and applicability domains. SHAPley Additive exPlanations (SHAPs) highlighted molecular features contributing to prediction outcomes of Vss, providing explanations that could aid drug design. Combined results show that computational predictions of PK are feasible at the drug design stage, with several ML technologies converging to successfully leverage historical PK data sets. Further studies are needed to unlock the full potential of this approach, especially with respect to data set sizes and quality, transfer learning between in vitro and in vivo data sets, model-independent quantification of uncertainty, and explainability of predictions.

Evaluation of the Success of High-Throughput Physiologically Based Pharmacokinetic (HT-PBPK) Modeling Predictions to Inform Early Drug Discovery

Application of Ivive and Pbpk Modeling in Prospective Prediction of Clinical Pharmacokinetics: Strategy and Approach During the Drug Discovery Phase with Four Case Studies

Systematic Evaluation of High-Throughput PBK Modelling Strategies for the Prediction of Intravenous and Oral Pharmacokinetics in Humans

Computational Predictions of Nonclinical Pharmacokinetics at the Drug Design Stage

Application of Physiologically Based Pharmacokinetic Modeling in Preclinical Studies: A Feasible Strategy to Practice the Principles of 3Rs

PHRMA CPCDC initiative on predictive models of human pharmacokinetics, part 5: Prediction of plasma concentration–time profiles in human by using the physiologically‐based pharmacokinetic modeling approach

Model-based Target Pharmacology Assessment (mTPA): An Approach Using PBPK/PD Modeling and Machine Learning to Design Medicinal Chemistry and DMPK Strategies in Early Drug Discovery

Shared learning from a physiologically based pharmacokinetic modeling strategy for human pharmacokinetics prediction through retrospective analysis of Genentech compounds

Can We Predict Clinical Pharmacokinetics of Highly Lipophilic Compounds by Integration of Machine Learning or In Vitro Data into Physiologically Based Models? A Feasibility Study Based on 12 Development Compounds

A Combination of Machine Learning and PBPK Modeling Approach for Pharmacokinetics Prediction of Small Molecules in Humans

Evaluation of Generic Methods to Predict Human Pharmacokinetics Using Physiologically Based Pharmacokinetic Model for Early Drug Discovery of Tyrosine Kinase Inhibitors

Building a Predictive PBPK Model for Human OATP Substrates: a Strategic Framework for Early Evaluation of Clinical Pharmacokinetic Variations Using Pitavastatin as an Example

PBPK modeling and simulation in drug research and development

Large scale compartmental model-based study of preclinical pharmacokinetic data and its impact on compound triaging in drug discovery

Predicting pharmacodynamic effects through early drug discovery with artificial intelligence-physiologically based pharmacokinetic (AI-PBPK) modelling

Exploring the Impact of Pharmacological Target-Mediated Low Plasma Exposure in Lead Compound Selection in Drug Discovery – A Modeling Approach

Estimating Organ-to-Plasma Ratios in physiologically based PK modeling: a simplified approach for early drug discovery

Overcoming the shortcomings of the extended-clearance concept: a framework for developing a physiologically-based pharmacokinetic (PBPK) model to select drug candidates involving transporter-mediated clearance

Discovery Phase Agrochemical Predictive Safety Assessment Using High Content In Vitro Data to Estimate an In Vivo Toxicity Point of Departure

Physiologically Based Pharmacokinetic Modelling for First-In-Human Predictions: An Updated Model Building Strategy Illustrated with Challenging Industry Case Studies