On Efficiently Finding Reverse K-Nearest Neighbors over Uncertain Graphs
Yunjun Gao,Xiaoye Miao,Gang Chen,Baihua Zheng,Deng Cai,Huiyong Cui
DOI: https://doi.org/10.1007/s00778-017-0460-y
2017-01-01
The VLDB Journal
Abstract:Reverse k-nearest neighbor (\(\hbox {R}k\hbox {NN}\)) query on graphs returns the data objects that take a specified query object q as one of their k-nearest neighbors. It has significant influence in many real-life applications including resource allocation and profile-based marketing. However, to the best of our knowledge, there is little previous work on \(\hbox {R}k\hbox {NN}\) search over uncertain graph data, even though many complex networks such as traffic networks and protein–protein interaction networks are often modeled as uncertain graphs. In this paper, we systematically study the problem of reverse k-nearest neighbor search on uncertain graphs (\(\hbox {UG-R}k\hbox {NN}\) search for short), where graph edges contain uncertainty. First, to address \(\hbox {UG-R}k\hbox {NN}\) search, we propose three effective heuristics, i.e., GSP, EGR, and PBP, which minimize the original large uncertain graph as a much smaller essential uncertain graph, cut down the number of possible graphs via the newly introduced graph conditional dominance relationship, and reduce the validation cost of data nodes in order to improve query efficiency. Then, we present an efficient algorithm, termed as SDP, to support \(\hbox {UG-R}k\hbox {NN}\) retrieval by seamlessly integrating the three heuristics together. In view of the high complexity of \(\hbox {UG-R}k\hbox {NN}\) search, we further present a novel algorithm called TripS, with the help of an adaptive stratified sampling technique. Extensive experiments using both real and synthetic graphs demonstrate the performance of our proposed algorithms.