Abstract:Abstract Motivation In drug discovery, it is crucial to assess the drug–target binding affinity (DTA). Although molecular docking is widely used, computational efficiency limits its application in large-scale virtual screening. Deep learning-based methods learn virtual scoring functions from labeled datasets and can quickly predict affinity. However, there are three limitations. First, existing methods only consider the atom-bond graph or one-dimensional sequence representations of compounds, ignoring the information about functional groups (pharmacophores) with specific biological activities. Second, relying on limited labeled datasets fails to learn comprehensive embedding representations of compounds and proteins, resulting in poor generalization performance in complex scenarios. Third, existing feature fusion methods cannot adequately capture contextual interaction information. Results Therefore, we propose a novel DTA prediction method named HeteroDTA. Specifically, a multi-view compound feature extraction module is constructed to model the atom–bond graph and pharmacophore graph. The residue concat graph and protein sequence are also utilized to model protein structure and function. Moreover, to enhance the generalization capability and reduce the dependence on task-specific labeled data, pre-trained models are utilized to initialize the atomic features of the compounds and the embedding representations of the protein sequence. A context-aware nonlinear feature fusion method is also proposed to learn interaction patterns between compounds and proteins. Experimental results on public benchmark datasets show that HeteroDTA significantly outperforms existing methods. In addition, HeteroDTA shows excellent generalization performance in cold-start experiments and superiority in the representation learning ability of drug–target pairs. Finally, the effectiveness of HeteroDTA is demonstrated in a real-world drug discovery study. Availability and implementation The source code and data are available at https://github.com/daydayupzzl/HeteroDTA.

GeneralizedDTA: combining pre-training and multi-task learning to predict drug-target binding affinity for unknown drug discovery

SubMDTA: drug target affinity prediction based on substructure extraction and multi-scale features

Breaking the barriers of data scarcity in drug–target affinity prediction

Enhancing Drug-Target Binding Affinity Prediction through Deep Learning and Protein Secondary Structure Integration

MMD-DTA: A multi-modal deep learning framework for drug-target binding affinity and binding region prediction

Drug-target binding affinity prediction using message passing neural network and self supervised learning

Enhancing generalizability and performance in drug–target interaction identification by integrating pharmacophore and pre-trained models

Modeling DTA by Combining Multiple-Instance Learning with a Private-Public Mechanism

DataDTA: a multi-feature and dual-interaction aggregation framework for drug–target binding affinity prediction

A deep learning method for drug-target affinity prediction based on sequence interaction information mining

BatchDTA: Implicit Batch Alignment Enhances Deep Learning-Based Drug-Target Affinity Estimation

GTAMP-DTA: Graph transformer combined with attention mechanism for drug-target binding affinity prediction

Multimodal contrastive representation learning for drug-target binding affinity prediction

Predicting Drug-Target Affinity by Learning Protein Knowledge From Biological Networks

Prediction of drug-target binding affinity based on multi-scale feature fusion

DGDTA: dynamic graph attention network for predicting drug-target binding affinity

A comprehensive review of the recent advances on predicting drug-target affinity based on deep learning

DeepFusionDTA: drug-target binding affinity prediction with information fusion and hybrid deep-learning ensemble model

Associative Learning Mechanism for Drug-Target Interaction Prediction

FusionDTA: attention-based feature polymerizer and knowledge distillation for drug-target binding affinity prediction

ImageDTA: A Simple Model for Drug-Target Binding Affinity Prediction