A Related Data Oriented Joint Entity Resolution Approach

Chen-Chen SUN,De-Rong SHEN,Yue KOU,Tie-Zheng NIE,Ge YU
DOI: https://doi.org/10.11897/SP.J.1016.2015.01739
2015-01-01
Chinese Journal of Computers
Abstract:We propose a graph-based iterative joint entity resolution approach.To start off,an entity data object relationship graph is built from the input dataset consisting of multiple classes of related data objects.It hires a hybrid similarity,combining a structure similarity based on semantic paths and an attribute-based similarity,to decide whether two data objects match.Then it merges the matched pair and contracts the neighborhood of the merged pair,which leads to enrichment of semantics of the neighborhood.Enrichment of semantics may help generate some new candidate data object pairs in the neighborhood,which will be resolved later.Generation of new candidate data object pairs is called similarity propagation,making it an iterative process. With the iterative process going on,semantics of the object graph becomes richer and richer, promoting accuracy of entity resolution.The experimental evaluation proves that the proposed approach outperforms existing joint entity resolution approaches and relationship-based single class entity resolution approaches in accuracy.
What problem does this paper attempt to address?