SubGE-DDI: A new prediction model for drug-drug interaction established through biomedical texts and drug-pairs knowledge subgraph enhancement
Yiyang Shi,Mingxiu He,Junheng Chen,Fangfang Han,Yongming Cai
DOI: https://doi.org/10.1371/journal.pcbi.1011989
2024-04-17
PLoS Computational Biology
Abstract:Biomedical texts provide important data for investigating drug-drug interactions (DDIs) in the field of pharmacovigilance. Although researchers have attempted to investigate DDIs from biomedical texts and predict unknown DDIs, the lack of accurate manual annotations significantly hinders the performance of machine learning algorithms. In this study, a new DDI prediction framework, Subgraph Enhance model, was developed for DDI (SubGE-DDI) to improve the performance of machine learning algorithms. This model uses drug pairs knowledge subgraph information to achieve large-scale plain text prediction without many annotations. This model treats DDI prediction as a multi-class classification problem and predicts the specific DDI type for each drug pair (e.g. Mechanism, Effect, Advise, Interact and Negative). The drug pairs knowledge subgraph was derived from a huge drug knowledge graph containing various public datasets, such as DrugBank, TwoSIDES, OffSIDES, DrugCentral, EntrezeGene, SMPDB (The Small Molecule Pathway Database), CTD (The Comparative Toxicogenomics Database) and SIDER. The SubGE-DDI was evaluated from the public dataset (SemEval-2013 Task 9 dataset) and then compared with other state-of-the-art baselines. SubGE-DDI achieves 83.91% micro F1 score and 84.75% macro F1 score in the test dataset, outperforming the other state-of-the-art baselines. These findings show that the proposed drug pairs knowledge subgraph-assisted model can effectively improve the prediction performance of DDIs from biomedical texts. Drug-drug interactions occur when two or more drugs react with each other, potentially leading to adverse drug reactions that can manifest as toxicity, reduced treatment efficacy, and, in extreme cases, patient mortality. With the growing number of approved drugs, detecting negative drug-drug interactions has become a crucial concern in pharmacovigilance. However, manually mining these interactions from the vast and rapidly expanding pool of published biomedical texts, such as clinical trial reports and literature, is arduous and challenging. To address this issue, various machine learning methods have been developed and applied to text mining for both known and unknown drug-drug interactions. Most of these methods primarily predict interactions by considering only drug molecular properties and text characters, neglecting valuable information about interactions between different entities, such as drugs, pathways, and side effects. In response, our approach seeks to enhance the performance of drug-drug interaction prediction by leveraging knowledge graphs constructed from a variety of biomedical entities. Our experimental results validate this approach, demonstrating that incorporating interactive information from knowledge graphs leads to the development of a more efficient model for predicting drug-drug interactions.
biochemical research methods,mathematical & computational biology