Abstract:Studying drug-target interactions (DTIs) is the foundational and crucial phase in drug discovery. Biochemical experiments, while being the most reliable method for determining drug-target affinity (DTA), are time-consuming and costly, making it challenging to meet the current demands for swift and efficient drug development. Consequently, computational DTA prediction methods have emerged as indispensable tools for this research. In this article, we propose a novel deep learning algorithm named GRA-DTA, for DTA prediction. Specifically, we introduce Bidirectional Gated Recurrent Unit (BiGRU) combined with a soft attention mechanism to learn target representations. We employ Graph Sample and Aggregate (GraphSAGE) to learn drug representation, especially to distinguish the different features of drug and target representations and their dimensional contributions. We merge drug and target representations by an attention neural network (ANN) to learn drug-target pair representations, which are fed into fully connected layers to yield predictive DTA. The experimental results showed that GRA-DTA achieved mean squared error of 0.142 and 0.225 and concordance index reached 0.897 and 0.890 on the benchmark datasets KIBA and Davis, respectively, surpassing the most state-of-the-art DTA prediction algorithms.
What problem does this paper attempt to address?
### Problems the Paper Aims to Solve
This paper aims to address the problem of Drug-Target Affinity (DTA) prediction. Specifically, the researchers propose a new deep learning algorithm—GRA-DTA (Graph Sample and Aggregate, Bidirectional Gated Recurrent Unit, and Attention Neural Network) for efficiently and accurately predicting the affinity between drugs and targets.
### Background and Motivation
1. **Importance of Drug-Target Interaction (DTI)**:
- Studying drug-target interactions is the foundation and key stage of drug discovery, helping to reveal the mechanisms of drug action and providing valuable insights for drug design and development.
- Traditional biochemical experimental methods, although reliable, are time-consuming and costly, making it difficult to meet the current demand for rapid and efficient drug development.
2. **Necessity of Computational Methods**:
- Therefore, computational methods have become indispensable in DTI research. These methods can accurately and reliably predict DTI, thereby accelerating the drug discovery process.
3. **Limitations of Existing Methods**:
- Traditional computational methods include molecular dynamics simulations and molecular docking. Although these methods have strong biological interpretability, they require a large amount of computational resources, have slow computation speeds, and are limited by the availability of protein 3D crystal structures.
- Early machine learning methods mainly treated DTI prediction as a binary classification task, only distinguishing between binding and non-binding categories, resulting in information loss and difficulties in threshold determination.
4. **Advantages of Regression Tasks**:
- In recent years, researchers have started modeling DTI prediction as a regression task, using Drug-Target Affinity (DTA) to accurately reflect the strength of DTI. DTA data can describe the strength of the binding interaction between drugs and targets, usually referring to dissociation constant (Kd), inhibition constant (Ki), and half-maximal inhibitory concentration (IC50).
### Proposed Method
1. **GRA-DTA Algorithm**:
- This algorithm combines Bidirectional Gated Recurrent Units (BiGRU) and a soft attention mechanism to learn target representations.
- It uses Graph Sample and Aggregate (GraphSAGE) to learn drug representations, with a particular emphasis on distinguishing the different features and dimensional contributions of drug and target representations.
- The drug and target representations are merged through an Attention Neural Network (ANN) to learn drug-target pair representations, which are then input into a fully connected layer for DTA prediction.
2. **Experimental Results**:
- On the benchmark datasets KIBA and Davis, GRA-DTA achieved Mean Squared Error (MSE) of 0.142 and 0.225, and Concordance Index (CI) of 0.897 and 0.890, respectively, surpassing the existing state-of-the-art DTA prediction algorithms.
### Conclusion
By proposing the GRA-DTA algorithm, this paper successfully addresses key issues in DTA prediction, demonstrating its potential application value in drug discovery. Experimental results show that the algorithm performs excellently on multiple benchmark datasets, particularly exhibiting strong generalization ability in cold-start scenarios.