Abstract:Predictive modeling of toxicity is a crucial step in the drug discovery pipeline. It can help filter out molecules with a high probability of failing in the early stages of de novo drug design. Thus, several machine learning (ML) models have been developed to predict the toxicity of molecules by combining classical ML techniques or deep neural networks with well-known molecular representations such as fingerprints or 2D graphs. But the more natural, accurate representation of molecules is expected to be defined in physical 3D space like in ab initio methods. Recent studies successfully used equivariant graph neural networks (EGNNs) for representation learning based on 3D structures to predict quantum-mechanical properties of molecules. Inspired by this, we investigated the performance of EGNNs to construct reliable ML models for toxicity prediction. We used the equivariant transformer (ET) model in TorchMD-NET for this. Eleven toxicity data sets taken from MoleculeNet, TDCommons, and ToxBenchmark have been considered to evaluate the capability of ET for toxicity prediction. Our results show that ET adequately learns 3D representations of molecules that can successfully correlate with toxicity activity, achieving good accuracies on most data sets comparable to state-of-the-art models. We also test a physicochemical property, namely, the total energy of a molecule, to inform the toxicity prediction with a physical prior. However, our work suggests that these two properties can not be related. We also provide an attention weight analysis for helping to understand the toxicity prediction in 3D space and thus increase the explainability of the ML model. In summary, our findings offer promising insights considering 3D geometry information via EGNNs and provide a straightforward way to integrate molecular conformers into ML-based pipelines for predicting and investigating toxicity prediction in physical space. We expect that in the future, especially for larger, more diverse data sets, EGNNs will be an essential tool in this domain.

Reusability report: exploring the utility of variational graph encoders for predicting molecular toxicity in drug design

ComABAN: refining molecular representation with the graph attention mechanism to accelerate drug discovery

Disease prediction with edge-variational graph convolutional networks

An Adaptive Graph Learning Method for Automated Molecular Interactions and Properties Predictions

Application of variational graph encoders as an effective generalist algorithm in computer-aided drug design

Equivariant Graph Neural Networks for Toxicity Prediction

Identification of Vital Chemical Information Via Visualization of Graph Neural Networks.

Exploring Low-Toxicity Chemical Space with Deep Learning for Molecular Generation

Could graph neural networks learn better molecular representation for drug discovery? A comparison study of descriptor-based and graph-based models

GraphADT: empowering interpretable predictions of acute dermal toxicity with multi-view graph pooling and structure remapping

AEGNN-M:A 3D Graph-Spatial Co-Representation Model for Molecular Property Prediction

A deep learning based multi-model approach for predicting drug-like chemical compound's toxicity

Integrating structure annotation and machine learning approaches to develop graphene toxicity models

Reusability report: Uncovering associations in biomedical bipartite networks via a bilinear attention network with domain adaptation

Predicting Drug-Drug Interactions using Deep Generative Models on Graphs

Graph2MDA: a multi-modal variational graph embedding model for predicting microbe-drug associations

Drug repositioning based on heterogeneous networks and variational graph autoencoders

Prediction of Adverse Drug Reactions by Combining Biomedical Tripartite Network and Graph Representation Model.

Advanced graph and sequence neural networks for molecular property prediction and drug discovery

Conformalized Graph Learning for Molecular ADMET Property Prediction and Reliable Uncertainty Quantification.