Collective Entity Disambiguation Based on Hierarchical Semantic Similarity.

Bingjing Jia,Hu Yang,Bin Wu,Ying Xing
DOI: https://doi.org/10.4018/ijdwm.2020040101
2020-01-01
International Journal of Data Warehousing and Mining
Abstract:Entity disambiguation involves mapping mentions in texts to the corresponding entities in a given knowledge base. Most previous approaches were based on handcrafted features and failed to capture semantic information over multiple granularities. For accurately disambiguating entities, various information aspects of mentions and entities should be used in. This article proposes a hierarchical semantic similarity model to find important clues related to mentions and entities based on multiple sources of information, such as contexts of the mentions, entity descriptions and categories. This model can effectively measure the semantic matching between mentions and target entities. Global features are also added, including prior popularity and global coherence, to improve the performance. In order to verify the effect of hierarchical semantic similarity model combined with global features, named HSSMGF, experiments were carried out on five publicly available benchmark datasets. Results demonstrate the proposed method is very effective in the case that documents have more mentions.
What problem does this paper attempt to address?