Abstract:Understanding complex interactions in biomedical networks is crucial for advancements in biomedicine. Traditional link prediction (LP) methods, using similarity metrics like Personalized PageRank, are limited in capturing the complexity of biological networks. Recently, representation-based learning techniques have emerged, mapping nodes to low-dimensional embeddings to enhance prediction accuracy. However, these methods often face challenges with interpretability and scalability in large, complex networks. Based on a representation of biological systems as knowledge graphs (KGs), which encode entities and their relationships as triplets, we propose here BioKGC, a novel graph neural network framework which builds upon the Neural Bellman-Ford Network (NBFNet). It addresses the limitations of previous methods by utilizing path-based reasoning for LP in biomedical knowledge graphs (KGs). Unlike node-embedding learning frameworks that optimize the embedding space based on single triplets, BioKGC learns representations between nodes by considering all relations along paths. This approach enhances prediction accuracy and interpretability, allowing for the visualization of influential paths and facilitating the validation of biological plausibility. BioKGC leverages a background regulatory graph (BRG) for enhanced message passing and implements a stringent negative sampling strategy to improve learning precision. In evaluations across various LP tasks, gene function annotation, drug-disease interaction prediction, synthetic lethality prediction, and lncRNA-mRNA regulatory relationship inference, BioKGC consistently outperformed state-of-the art methods. BioKGC outperformed knowledge graph embedding and GNN-based methods in gene function prediction, especially with BRG information. We demonstrated that BioKGC effectively predicts drug-disease interactions in zero-shot learning scenarios, surpassing state-of-the-art models like TxGNN. Additionally, BioKGC demonstrated robust performance in synthetic lethality prediction and the capacity for scoring novel lncRNA-mRNA interactions, showcasing its versatility in diverse biomedical applications. One of BioKGC's key advantages is its interpretability, enabling researchers to trace prediction paths and gain insights into molecular mechanisms. Combined with its use of regulatory information for message passing, BioKGC is a powerful tool for predicting complex biological interactions, making it valuable for drug discovery and personalized medicine.

Applying BioBERT to Extract Germline Gene-Disease Associations for Building a Knowledge Graph from the Biomedical Literature

A Biomedical Knowledge Graph for Biomarker Discovery in Cancer

Semi-Automating Knowledge Base Construction for Cancer Genetics

BioMedGraphica: An All-in-One Platform for Biomedical Prior Knowledge and Omic Signaling Graph Generation

Building a PubMed knowledge graph

A Knowledge Graph Approach to Elucidate the Role of Organellar Pathways in Disease via Biomedical Reports

Multi-ontology embeddings approach on human-aligned multi-ontologies representation for gene-disease associations prediction

Multimodal reasoning based on knowledge graph embedding for specific diseases

Building a knowledge graph to enable precision medicine

Path-based reasoning in biomedical knowledge graphs

BERT Based Clinical Knowledge Extraction for Biomedical Knowledge Graph Construction and Analysis

A knowledge graph approach to predict and interpret disease-causing gene interactions

[Passive remote sensing of VOC in atmosphere by FTIR spectrometry].

Descriptive Knowledge Graph in Biomedical Domain

Alzheimer Disease Knowledge Graph Enhances Knowledge Discovery and Disease Prediction

Knowledge Graph Embeddings in the Biomedical Domain: Are They Useful? A Look at Link Prediction, Rule Learning, and Downstream Polypharmacy Tasks

Alzheimer’s Disease Knowledge Graph Enhances Knowledge Discovery and Disease Prediction

BioBLP: A Modular Framework for Learning on Multimodal Biomedical Knowledge Graphs

Reduction of Diarrhea-associated Hospitalizations Among Children Aged <5 Years in Panama Following the Introduction of Rotavirus Vaccine

Literature mining discerns latent disease–gene relationships

KDGene: knowledge graph completion for disease gene prediction using interactional tensor decomposition