Benchmarking Graph Learning for Drug-Drug Interaction Prediction

Zhenqian Shen,Mingyang Zhou,Yongqi Zhang,Quanming Yao
2024-10-29
Abstract:Predicting drug-drug interaction (DDI) plays an important role in pharmacology and healthcare for identifying potential adverse interactions and beneficial combination therapies between drug pairs. Recently, a flurry of graph learning methods have been introduced to predict drug-drug interactions. However, evaluating existing methods has several limitations, such as the absence of a unified comparison framework for DDI prediction methods, lack of assessments in meaningful real-world scenarios, and insufficient exploration of side information usage. In order to address these unresolved limitations in the literature, we propose a DDI prediction benchmark on graph learning. We first conduct unified evaluation comparison among existing methods. To meet realistic scenarios, we further evaluate the performance of different methods in settings with new drugs involved and examine the performance across different DDI types. Component analysis is conducted on the biomedical network to better utilize side information. Through this work, we hope to provide more insights for the problem of DDI prediction. Our implementation and data is open-sourced at <a class="link-external link-https" href="https://anonymous.4open.science/r/DDI-Benchmark-ACD9/" rel="external noopener nofollow">this https URL</a>.
Machine Learning
What problem does this paper attempt to address?
The main problems that this paper attempts to solve include the following aspects: 1. **Lack of a unified evaluation framework**: - Currently, the evaluation of drug - drug interaction (DDI) prediction methods lacks a unified comparison framework. This includes unified datasets, evaluation metrics, and hyper - parameter settings. This inconsistency makes it difficult to accurately assess the effectiveness of existing methods. 2. **Lack of evaluation in real - world scenarios**: - Most of the existing DDI prediction methods rarely consider new drugs or special types of DDI during evaluation. However, these situations are very important in practical applications. For example, when introducing a new drug, it is crucial to evaluate its potential interactions with other drugs; at the same time, different types of DDI also have different impacts on clinical safety and efficacy. 3. **Insufficient use of side information**: - The data sparsity problem is a common challenge in DDI prediction, which severely limits the performance of machine - learning methods. To solve this problem, researchers have introduced biomedical networks as side information, but it is still unclear which parts of the biomedical network information are most helpful for DDI prediction. ### Specific objectives of the paper To address the above problems, the paper proposes the following specific objectives: - **Establish a unified evaluation benchmark**: By designing a comprehensive evaluation framework, uniformly evaluate the existing DDI prediction methods to ensure that different methods can be fairly compared under the same conditions. - **Evaluate performance in real - world scenarios**: Pay special attention to situations involving new drugs and different types of DDI to ensure the effectiveness of the model in practical applications. - **Explore the utilization of biomedical networks**: Through component analysis of biomedical networks, find out which parts of the information are most helpful for DDI prediction, thereby improving the prediction performance. ### Formula representation The formulas involved in the paper are represented in Markdown format as follows: - Definition of DDI graph: \[ G_{\text{DDI}}=\{ V_{\text{DDI}}, E_{\text{DDI}}, R_{\text{DDI}} \} \] - \( V_{\text{DDI}} \) represents the set of drug nodes - \( E_{\text{DDI}} \) represents the set of edges of interactions between drugs - \( R_{\text{DDI}} \) represents the set of interaction types - Definition of biomedical network: \[ G_{\text{BN}} = \{ V_{\text{BN}}, E_{\text{BN}}, R_{\text{BN}} \} \] - \( V_{\text{BN}} \) represents the set of biomedical entities - \( E_{\text{BN}}=\{ (u, r, v) | u, v \in V_{\text{BN}}, r \in R_{\text{BN}} \} \) represents the set of relationship edges between entities - \( R_{\text{BN}} \) represents the set of relationship types Through these improvements, the paper hopes to provide a deeper understanding of DDI prediction and promote the development of future research.