Abstract:Background and objectives: Assessment of drugs toxicity and associated biomarker genes is one of the most important tasks in the pre-clinical phase of drug development pipeline as well as in toxicogenomic studies. There are few statistical methods for the assessment of doses of drugs (DDs) toxicity and their associated biomarker genes. However, these methods consume more time for computation of the model parameters using the EM (expectation-maximization) based iterative approaches. To overcome this problem, in this paper, an attempt is made to propose an alternative approach based on hierarchical clustering (HC) for the same purpose. Methods and materials: There are several types of HC approaches whose performance depends on different similarity/distance measures. Therefore, we explored suitable combinations of distance measures and HC methods based on Japanese Toxicogenomics Project (TGP) datasets for better clustering/co-clustering between DDs and genes as well as to detect toxic DDs and their associated biomarker genes. Results: We observed that Word’s HC method with each of Euclidean, Manhattan, and Minkowski distance measures produces better clustering/co-clustering results. For an example, in the case of the glutathione metabolism pathway (GMP) dataset LOC100359539/Rrm2, Gpx6, RGD1562107, Gstm4, Gstm3, G6pd, Gsta5, Gclc, Mgst2, Gsr, Gpx2, Gclm, Gstp1, LOC100912604/Srm, Gstm4, Odc1, Gsr, Gss are the biomarker genes and Acetaminophen_Middle, Acetaminophen_High, Methapyrilene_High, Nitrofurazone_High, Nitrofurazone_Middle, Isoniazid_Middle, Isoniazid_High are their regulatory (associated) DDs explored by our proposed co-clustering algorithm based on the distance and HC method combination Euclidean: Word. Similarly, for the peroxisome proliferator-activated receptor signaling pathway (PPAR-SP) dataset Cpt1a, Cyp8b1, Cyp4a3, Ehhadh, Plin5, Plin2, Fabp3, Me1, Fabp5, LOC100910385, Cpt2, Acaa1a, Cyp4a1, LOC100365047, Cpt1a, LOC100365047, Angptl4, Aqp7, Cpt1c, Cpt1b, Me1 are the biomarker genes and Aspirin_Low, Aspirin_Middle, Aspirin_High, Benzbromarone_Middle, Benzbromarone_High, Clofibrate_Middle, Clofibrate_High, WY14643_Low, WY14643_High, WY14643_Middle, Gemfibrozil_Middle, Gemfibrozil_High are their regulatory DDs. Conclusions: Overall, the methods proposed in this article, co-cluster the genes and DDs as well as detect biomarker genes and their regulatory DDs simultaneously consuming less time compared to other mentioned methods. The results produced by the proposed methods have been validated by the available literature and functional annotation.

Classification of Chemicals Based on Concentration-Dependent Toxicological Data Using ToxClust

Classification of chemical compounds based on the correlation between \textit{in vitro} gene expression profiles

Mold(2), molecular descriptors from 2D structures for chemoinformatics and toxicoinformatics.

Pathway-based Assessment of Single Chemicals and Mixtures by a High-Throughput Transcriptomics Approach

Grouping 34 Chemicals Based on Mode of Action Using Connectivity Mapping

Predictive toxicology of chemical mixtures using proteome-wide thermal profiling and protein target properties

Signature Analysis of High-Throughput Transcriptomics Screening Data for Mechanistic Inference and Chemical Grouping

Identification of nonmonotonic concentration-responses in Tox21 high-throughput screening estrogen receptor assays

The TTC Data Mart: An interactive browser for threshold of toxicological concern calculations

Assessment of Drugs Toxicity and Associated Biomarker Genes Using Hierarchical Clustering

Application and Prospect of Computational Toxicology in Screening of Endocrine Disrupting Chemicals

A Ternary Classification Using Machine Learning Methods of Distinct Estrogen Receptor Activities Within A Large Collection of Environmental Chemicals

Integration of Leave-One-out Method and Real-Time Live Cell Reporter Array System to Assess the Toxicity of Mixtures.

Cheminformatics Analysis of the Multitarget Structure-Activity Landscape of Environmental Chemicals Binding to Human Endocrine Receptors

In Silico Prediction of Chemical Acute Oral Toxicity Using Multi-Classification Methods

Qualitative and Quantitative Concentration-Response Modelling of Gene Co-expression Networks to Unlock Hepatotoxic Mechanisms for Next Generation Chemical Safety Assessment

Systematic Approaches for the Encoding of Chemical Groups: A Case Study

Computational Insights into Reproductive Toxicity: Clustering, Mechanism Analysis, and Predictive Models

Screening and characterization of 133 physiologically-relevant environmental chemicals for reproductive toxicity

Predictive Models for Human Organ Toxicity Based on in Vitro Bioactivity Data and Chemical Structure

Deep Learning Bridged Bioactivity, Structure, and GC-HRMS-Readable Evidence to Decipher Nontarget Toxicants in Sediments