BioKG-CMI: a Multi-Source Feature Fusion Model Based on Biological Knowledge Graph for Predicting Circrna-Mirna Interactions

Mengmeng Wei,Lei Wang,Yang Li,Zhengwei Li,Bowei Zhao,Xiaorui Su,Yu Wei,Zhuhong You
DOI: https://doi.org/10.1007/s11432-024-4098-3
2024-01-01
Science China Information Sciences
Abstract:This study proposes a model named BioKG-CMI to predict CMIs based on a biological knowledge graph. Faced with limited data, we employ subcellular localization to generate negative samples that align more closely with biological logic. To mine semantic information in circRNA and miRNA sequences, we introduce the pre-trained model BERT to learn sequence feature representation. Guided by the hypothesis that adjacent molecules have similar functions, we calculate spatial proximity between nodes of the same class. The DisMult algorithm is applied to extract the potential logical rules of the knowledge graph and learn entity and relationship representations. Subsequently, the integration of multi-feature successfully addresses the challenge of expressing the complex biological knowledge graph and overcoming the limitation of single-feature inadequacy. Multiple comparative experiments and case studies demonstrate the robustness of the proposed model.
What problem does this paper attempt to address?