Accelerating Graph Similarity Search via Efficient GED Computation
Lijun Chang,Xing Feng,Kai Yao,Lu Qin,Wenjie Zhang
DOI: https://doi.org/10.1109/tkde.2022.3153523
IF: 9.235
2022-01-01
IEEE Transactions on Knowledge and Data Engineering
Abstract:Computing the graph edit distance (GED) between graphs is the core operation in graph similarity search. Recent studies suggest that the existing index structures are ineffective in reducing the overall processing time of graph similarity search, and that directly verifying the GED between the query graph and every data graph in the database is still the best option. The state-of-the-art algorithm for GED verification is the recently proposed ${{{\mathsf {AStar}}} \text{-}{\mathtt {LSa}} }$AStar-LSa. However, ${{{\mathsf {AStar}}} \text{-}{\mathtt {LSa}} }$AStar-LSa may consume an extremely large amount of main memory or even run out-of-memory, when the graphs become larger and/or the GED threshold becomes larger. In this paper, we aim to improve the efficiency of GED verification and simultaneously lower the main memory consumption. To achieve that, we propose a new estimation for the lower bounds of partial mappings between graphs. We formally prove that our new lower bound is tighter than the one used in ${{{\mathsf {AStar}}} \text{-}{\mathtt {LSa}} }$AStar-LSa. Moreover, we also propose efficient algorithms to compute the lower bounds, as well as optimization techniques to improve the efficiency. Empirical studies on real datasets demonstrate that our newly proposed algorithm ${{{\mathsf {AStar}}} \text{-}{\mathtt {BMao}} }$AStar-BMao runs faster, and at the same time consumes much less main memory, than ${{{\mathsf {AStar}}} \text{-}{\mathtt {LSa}} }$AStar-LSa.
computer science, information systems, artificial intelligence,engineering, electrical & electronic