Abstract:Introduction: With the escalating menace of organic compounds in environmental pollution imperiling the survival of aquatic organisms, the investigation of organic compound toxicity across diverse aquatic species assumes paramount significance for environmental protection. Understanding how different species respond to these compounds helps assess the potential ecological impact of pollution on aquatic ecosystems as a whole. Compared with traditional experimental methods, deep learning methods have higher accuracy in predicting aquatic toxicity, faster data processing speed and better generalization ability. Objectives: This article presents ATFPGT-multi, an advanced multi-task deep neural network prediction model for organic toxicity. Methods: The model integrates molecular fingerprints and molecule graphs to characterize molecules, enabling the simultaneous prediction of acute toxicity for the same organic compound across four distinct fish species. Furthermore, to validate the advantages of multi-task learning, we independently construct prediction models, named ATFPGT-single, for each fish species. We employ cross-validation in our experiments to assess the performance and generalization ability of ATFPGT-multi. Results: The experimental results indicate, first, that ATFPGT-multi outperforms ATFPGT-single on four fish datasets with AUC improvements of 9.8%, 4%, 4.8%, and 8.2%, respectively, demonstrating the superiority of multi-task learning over single-task learning. Furthermore, in comparison with previous algorithms, ATFPGT-multi outperforms comparative methods, emphasizing that our approach exhibits higher accuracy and reliability in predicting aquatic toxicity. Moreover, ATFPGT-multi utilizes attention scores to identify molecular fragments associated with fish toxicity in organic molecules, as demonstrated by two organic molecule examples in the main text, demonstrating the interpretability of ATFPGT-multi. Conclusion: In summary, ATFPGT-multi provides important support and reference for the further development of aquatic toxicity assessment. All of codes and datasets are freely available online at https://github.com/zhaoqi106/ATFPGT-multi.

Myxoid Meningiomas of the Rostral Cervical Spinal Cord and Caudal Fossa in Four Dogs

Multitask CapsNet: an Imbalanced Data Deep Learning Method for Predicting Toxicants

Multimodal Representation Learning via Graph Isomorphism Network for Toxicity Multitask Learning

Transfer learning using attentions across atomic systems with graph neural networks (TAAG)

Mining Toxicity Information from Large Amounts of Toxicity Data

Multitask Deep Learning with Dynamic Task Balancing for Quantum Mechanical Properties Prediction

TransG-net: transformer and graph neural network based multi-modal data fusion network for molecular properties prediction

Multi-task aquatic toxicity prediction model based on multi-level features fusion

KG-MTL: Knowledge Graph Enhanced Multi-Task Learning for Molecular Interaction

Multi-layer graph attention neural networks for accurate drug-target interaction mapping

An Integrated Transfer Learning and Multitask Learning Approach for Pharmacokinetic Parameter Prediction

BiTGNN: Prediction of drug–target interactions based on bidirectional transformer and graph neural network on heterogeneous graph

Graph Neural Tree: A novel and interpretable deep learning-based framework for accurate molecular property predictions

Multitask Learning On Graph Neural Networks Applied To Molecular Property Predictions

Metapath-aggregated heterogeneous graph neural network for drug-target interaction prediction

GSAML-DTA: An interpretable drug-target binding affinity prediction model based on graph neural networks with self-attention mechanism and mutual information

Integrating Chemical Language and Molecular Graph in Multimodal Fused Deep Learning for Drug Property Prediction

Co‐Model for Chemical Toxicity Prediction Based on Multi‐task Deep Learning

Multidta: drug-target binding affinity prediction via representation learning and graph convolutional neural networks

Multi-task learning models for predicting active compounds

Predictive Multitask Deep Neural Network Models for ADME-Tox Properties: Learning from Large Data Sets