Improving Semantic Similarity Computation Via Subgraph Feature Fusion Based on Semantic Awareness

Yuanfei Deng,Wen Bai,Jiawei Li,Shun Mao,Yuncheng Jiang
DOI: https://doi.org/10.1016/j.engappai.2024.108947
IF: 8
2024-01-01
Engineering Applications of Artificial Intelligence
Abstract:Semantic similarity is a critical aspect of natural language processing, as it evaluates the degree of similarity within a knowledge graph. Various computational methods, including distance-based and feature-based approaches, have been proposed to accurately measure this similarity. While existing methods can leverage diverse features within heterogeneous knowledge graphs, representing the overall structure, which encompasses a wide array of heterogeneous elements such as abstract descriptions and hidden relationships, remains challenging. To address the aforementioned challenges, our approach begins by employing the same text embedding method to map both abstract and category features into a unified vector space. We then extract features from DBpedia to construct concept and category graphs. Subsequently, we introduce a k-truss method based on semantic awareness within the DBpedia Concept Graph. This method identifies the significance of neighbouring concept nodes and assigns varying weights to enhance the representation of abstract features. Additionally, we propose a k-core method based on semantic awareness within the DBpedia Category Graph. This method identifies the importance of neighbouring category nodes and assigns different weights to enhance the representation of category features. Finally, we employ a hybrid weighting approach based on a feature fusion model to calculate semantic similarity. Experimental results demonstrate that our methods achieve a 5.33% improvement compared to existing approaches.
What problem does this paper attempt to address?