BiTGNN: Prediction of drug–target interactions based on bidirectional transformer and graph neural network on heterogeneous graph
Qingqian Zhang,Changxiang He,Xiaofei Qin,Peisheng Yang,Junyang Kong,Yaping Mao,Die Li
DOI: https://doi.org/10.1142/s1793524524500256
2024-05-25
International Journal of Biomathematics
Abstract:International Journal of Biomathematics, Ahead of Print. Drug–target interaction (DTI) is a widely explored topic in the field of bioinformatics and plays a pivotal role in drug discovery. However, the traditional bio-experimental process of drug–target interaction identification requires a large investment of time and labor. To address this challenge, graph neural network (GNN) approaches in deep learning are becoming a prominent trend in the field of DTI research, which is characterized by multimodal processing of data, feature learning and interpretability in DTI. Nevertheless, some methods are still limited by homogeneous graphs and single features. To address the problems, we mechanistically analyze graph convolutional neural networks (GCNs) and graph attentional neural networks (GATs) to propose a new model for the prediction of drug–target interactions using graph neural networks named BiTGNN [Bidirectional Transformer (Bi-Transformer)–graph neural network]. The method first establishes drug–target pairs through the pseudo-position specificity scoring matrix (PsePSSM) and drug fingerprint data, and constructs a heterogeneous network by utilizing the relationship between the drug and the target. Then, the computational extraction of drug and target attributes is performed using GCNs and GATs for the purpose of model information flow extension and graph information enhancement. We collect interaction data using the proposed Bi-Transformer architecture, in which we design a bidirectional cross-attention mechanism for calculating the effects of drug–target interactions for realistic biological interaction simulations. Finally, a feed-forward neural network is used to obtain the feature matrices of the drug and the target, and DTI prediction is performed by fusing the two feature matrices. The Enzyme, Ion Channel (IC), G Protein-coupled Receptor (GPCR) and Nuclear Receptor (NR) datasets are used in the experiments, and compared with several existing mainstream models, our model outperforms in Area Under the ROC Curve (AUC), Specificity, Accuracy and the metric Area Under the Precision–Recall Curve (AUPR).
mathematical & computational biology