Abstract:Drug-target binding affinity prediction plays an important role in the early stages of drug discovery, which can infer the strength of interactions between new drugs and new targets. However, the performance of previous computational models is limited by the following drawbacks. The learning of drug representation relies only on supervised data, without taking into account the information contained in the molecular graph itself. Moreover, most previous studies tended to design complicated representation learning module, while uniformity, which is used to measure representation quality, is ignored. In this study, we propose GraphCL-DTA, a graph contrastive learning with molecular semantics for drug-target binding affinity prediction. In GraphCL-DTA, we design a graph contrastive learning framework for molecular graphs to learn drug representations, so that the semantics of molecular graphs are preserved. Through this graph contrastive framework, a more essential and effective drug representation can be learned without additional supervised data. Next, we design a new loss function that can be directly used to smoothly adjust the uniformity of drug and target representations. By directly optimizing the uniformity of representations, the representation quality of drugs and targets can be improved. The effectiveness of the above innovative elements is verified on two real datasets, KIBA and Davis. The excellent performance of GraphCL-DTA on the above datasets suggests its superiority to the state-of-the-art model.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is how to predict the binding affinity between drugs and targets more accurately in the early stage of drug discovery. Specifically, existing computational models have limitations in the following aspects: 1. **Drug representation learning depends on supervised data**: Most existing methods only rely on the supervised data of known drug - target binding affinities to learn drug representations, without making full use of the information of molecular graphs themselves. This leads to a dependence on a large amount of supervised data, increasing time and cost. 2. **Complex representation learning module design**: Many studies tend to design complex representation learning modules, but overlook an important indicator of representation quality - uniformity. Uniformity is used to measure the quality of representation, but it has often been ignored in previous models. 3. **Loss functions do not consider the uniformity of representations**: Existing models usually use the mean squared error (MSE) as a loss function for parameter optimization, but this loss function does not take into account the uniformity of drug representations and target representations, thus affecting the performance of the model. To overcome these limitations, the authors proposed the GraphCL - DTA model, which improves the prediction of drug - target binding affinities through the following innovations: 1. **Graph contrastive learning framework**: A graph contrastive learning framework was designed to learn drug representations by using the semantic information of molecular graphs without the need for additional supervised data. By adding controllable random noise in the drug representation space to generate contrastive views, this framework can preserve the semantic information of molecular graphs and learn more essential and effective drug representations. 2. **Loss function for optimizing representation uniformity**: A new loss function was designed that can directly adjust the uniformity of drug representations and target representations. Based on the MSE loss function, the new loss function adds a regularization term to directly optimize the uniformity of representations, thereby improving the representation quality and the predictive ability of the model. 3. **Experimental verification**: Extensive experiments were carried out on two publicly available datasets (KIBA and Davis), and the results show that the GraphCL - DTA model outperforms the existing state - of - the - art models on these datasets. In summary, the main contribution of this paper lies in the design of a graph contrastive learning framework that can learn more effective drug representations without relying on additional supervised data, and further improves the performance of the model by optimizing the uniformity of representations.

GraphCL-DTA: a graph contrastive learning with molecular semantics for drug-target binding affinity prediction

Supervised graph co-contrastive learning for drug–target interaction prediction

Predicting drug–target binding affinity with cross-scale graph contrastive learning

Semi-supervised heterogeneous graph contrastive learning for drug-target interaction prediction

G-K BertDTA: A graph representation learning and semantic embedding-based framework for drug-target affinity prediction

Hierarchical graph representation learning for the prediction of drug-target binding affinity

Drug–target affinity prediction with extended graph learning-convolutional networks

Multimodal contrastive representation learning for drug-target binding affinity prediction

A Novel Descriptor and Molecular Graph-Based Bimodal Contrastive Learning Framework for Drug Molecular Property Prediction.

3D graph contrastive learning for molecular property prediction

GSAML-DTA: An interpretable drug-target binding affinity prediction model based on graph neural networks with self-attention mechanism and mutual information

Similarity measures-based graph co-contrastive learning for drug–disease association prediction

AttentionMGT-DTA: A multi-modal drug-target affinity prediction using graph transformer and attention mechanism

Drug-Target Affinity Prediction Based on Improved GraphDTA

Multidta: drug-target binding affinity prediction via representation learning and graph convolutional neural networks

Prediction of multi-relational drug-gene interaction via Dynamic hyperGraph Contrastive Learning.

An Adaptive Graph Learning Method for Automated Molecular Interactions and Properties Predictions

Encephalographic Cortical Atrophy

[Lung abnormalities in interstitium and alveoli following treatment with amiodarone].

GraphDTA: predicting drug-target binding affinity with graph neural networks

MoCL: Data-driven Molecular Fingerprint via Knowledge-aware Contrastive Learning from Molecular Graph