Improved Detection of Drug-Induced Liver Injury by Integrating Predicted in vivo and in vitro Data

Srijit Seal,Dominic P Williams,Layla Hosseini-Gerami,Manas Mahale,Anne E. Carpenter,Ola Spjuth,Andreas Bender
DOI: https://doi.org/10.1101/2024.01.10.575128
2024-06-08
Abstract:Drug-induced liver injury (DILI) has been significant challenge in drug discovery, often leading to clinical trial failures and necessitating drug withdrawals. The existing suite of in vitro proxy-DILI assays is generally effective at identifying compounds with hepatotoxicity. However, there is considerable interest in enhancing in silico prediction of DILI because it allows for the evaluation of large sets of compounds more quickly and cost-effectively, particularly in the early stages of projects. In this study, we aim to study ML models for DILI prediction that first predicts nine proxy-DILI labels and then uses them as features in addition to chemical structural features to predict DILI. The features include in vitro (e.g., mitochondrial toxicity, bile salt export pump inhibition) data, in vivo (e.g., preclinical rat hepatotoxicity studies) data, pharmacokinetic parameters of maximum concentration, structural fingerprints, and physicochemical parameters. We trained DILI-prediction models on 888 compounds from the DILIst dataset and tested on a held-out external test set of 223 compounds from DILIst dataset. The best model, DILIPredictor, attained an AUC-ROC of 0.79. This model enabled the detection of top 25 toxic compounds compared to models using only structural features (2.68 LR+ score). Using feature interpretation from DILIPredictor, we were able to identify the chemical substructures causing DILI as well as differentiate cases DILI is caused by compounds in animals but not in humans. For example, DILIPredictor correctly recognized 2-butoxyethanol as non-toxic in humans despite its hepatotoxicity in mice models. Overall, the DILIPredictor model improves the detection of compounds causing DILI with an improved differentiation between animal and human sensitivity as well as the potential for mechanism evaluation. DILIPredictor is publicly available at https://broad.io/DILIPredictor for use via web interface and with all code available for download and local implementation via https://pypi.org/project/dilipred/.
Bioinformatics
What problem does this paper attempt to address?
The paper aims to address the issue of predicting drug-induced liver injury (DILI). Specifically, while existing in vitro and in vivo experiments can effectively identify compounds with hepatotoxicity, they have limitations in predicting liver toxicity in actual human drug use, especially in the early stages of drug development. Additionally, differences in hepatotoxicity between species make it difficult to infer human safety from animal experiment results. To improve the accuracy of DILI prediction, the authors propose a new machine learning model called DILIPredictor. This model first predicts nine proxy-DILI labels related to DILI, including mitochondrial toxicity, bile salt export pump inhibition, etc., and then uses these labels as features, combined with chemical structure features and other physicochemical parameters, for the final DILI prediction. In this way, the model can not only more accurately detect compounds that may cause DILI but also better distinguish the sensitivity differences between animals and humans to drugs, thereby providing more reliable support for safety assessment in the drug development process. Overall, the goal of this study is to improve the ability to predict DILI by integrating multiple data sources, particularly in vitro and in vivo experimental data and chemical structure information, to reduce failures in the later stages of clinical trials due to unforeseen hepatotoxicity, and to enhance the safety and efficiency of drug development.