Abstract:In silico methods are essential to the safety evaluation of chemicals. Computational risk assessment offers several approaches, with data science and knowledge-based methods becoming an increasingly important sub-group. One of the substantial attributes of data science is that it allows using existing data to find correlations, build strong hypotheses, and create new, valuable knowledge that may help to reduce the number of resource intensive experiments. In choosing a suitable method for toxicity prediction, the available data and desired toxicity endpoint are two essential factors to consider. The complexity of the endpoint can impact the success rate of the in silico models. For highly complex endpoints such as hepatotoxicity, it can be beneficial to decipher the toxic event from a more systemic point of view. We propose a data science-based modelling pipeline that uses compounds` connections to tissue-specific biological targets, interactome, and biological pathways as descriptors of compounds. Models trained on different combinations of the collected, compound-target, compound-interactor, and compound-pathway profiles, were used to predict the hepatotoxicity of drug-like compounds. Several tree-based models were trained, utilizing separate and combined target, interactome and pathway level variables. The model using combined descriptors of all levels and the random forest algorithm was further optimized. Descriptor importance for model performance was addressed and examined for a biological explanation to define which targets or pathways can have a crucial role in toxicity. Descriptors connected to cytochromes P450 enzymes, heme degradation and biological oxidation received high weights. Furthermore, the involvement of other, less discussed processes in connection with toxicity, such as the involvement of RHO GTPase effectors in hepatotoxicity, were marked as fundamental. The optimized combined model using only the selected descriptors yielded the best performance with an accuracy of 0.766. The same dataset using classical Morgan fingerprints for compound representation yielded models with similar performance measures, as well as the combination of systems biology-based descriptors and Morgan fingerprints. Consequently, adding the structural information of compounds did not enhance the predictive value of the models. The developed systems biology-based pipeline comprises a valuable tool in predicting toxicity, while providing novel insights about the possible mechanisms of the unwanted events.

Predicting Organ Toxicity Using in Vitro Bioactivity Data and Chemical Structure.

Predicting Organ Toxicity Using &Itin Vitro&It Bioactivity Data And Chemical Structure

Predictive Models for Human Organ Toxicity Based on in Vitro Bioactivity Data and Chemical Structure

Predicting hepatotoxicity using ToxCast in vitro bioactivity and chemical structure.

Toxicity prediction using target, interactome, and pathway profiles as descriptors

In Silico Prediction of Chemical Acute Oral Toxicity Using Multi-Classification Methods

Identifying Protein Features and Pathways Responsible for Toxicity Using Machine Learning and Tox21: Implications for Predictive Toxicology

Machine Learning for Predicting Organ Toxicity

In Silico Prediction Of Chemical Toxicity For Drug Design Using Machine Learning Methods And Structural Alerts

In Silico Prediction of Chemical Reproductive Toxicity Using Machine Learning

Hybrid non-animal modeling: A mechanistic approach to predict chemical hepatotoxicity

Predicting Chemical Toxicity Effects Based on Chemical-Chemical Interactions.

In Silico Prediction of Chemical Genotoxicity Using Machine Learning Methods and Structural Alerts.

Predictive Systems Toxicology

In Silico Prediction of Tetrahymena Pyriformis Toxicity for Diverse Industrial Chemicals with Substructure Pattern Recognition and Machine Learning Methods

A Deep Learning Based Multi-Model Approach for Predicting Drug-Like Chemical Compound’s Toxicity

An Explainable Supervised Machine Learning Model for Predicting Respiratory Toxicity of Chemicals Using Optimal Molecular Descriptors

A deep learning based multi-model approach for predicting drug-like chemical compound's toxicity

Identification of Optimal Machine Learning Algorithms and Molecular Fingerprints for Explainable Toxicity Prediction Models Using ToxCast/Tox21 Bioassay Data

ADMET Evaluation in Drug Discovery. 18. Reliable Prediction of Chemical-Induced Urinary Tract Toxicity by Boosting Machine Learning Approaches

In Silico Prediction of Chemical Neurotoxicity Using Machine Learning.