Graph Similarity Join with K-Hop Tree Indexing

yue wang,hongzhi wang,chen ye,hong gao
DOI: https://doi.org/10.1007/978-3-662-46248-5_6
2015-01-01
Abstract:Graph similarity join has become imperative for integrating noisy and inconsistent data from multiple data sources. The edit distance is commonly used to measure the similarity between graphs. To accelerate the similarity join based on graph edit distance, in the paper, we make use of a preprocessing strategy to remove the mismatching graph pairs with significant differences. Then a novel method of building indexes for each graph is proposed by grouping the nodes which can be reached in k hops for each key node with structure conservation, which is the k-hop-tree based indexing method. Experiments on real and synthetic graph databases also confirm that our method can achieve good join quality in graph similarity join. Besides, the join process can be finished in polynomial time.
What problem does this paper attempt to address?