A Top-k Query Calculation Algorithm in Distributed Networks with Probabilistic Guarantees

Wenting Yu,Yanming Shen,Keqiu Li,Junfeng Xu,Yong Li
DOI: https://doi.org/10.1109/WMWA.2009.82
2009-01-01
Abstract:Top-k queries based on ranking elements stop query processing when the top-k ranked results can be safely determined.There are two main methods for top-k query, accurate top-k query and approximate top-k query. However,existing top-k query consumes much bandwidth. Motivated by userpsilas goal to identify one or a few relevant data behind top-k query, it is attractive to use approximate top-k query algorithms to reduce the bandwidth usage. In this paper,we propose a three-phase approximate algorithm (TPAA),which is based on determining the value difference of the same object in different nodes. TPAA precuts the object whose values have big difference in different nodes. By precutting the illegitimate objects with a high probability,TPAA can reduce bandwidth consumption with high precision in some cases. It also supports probabilistic pruning of candidates, considerably reducing bandwidth usage at the expense of a small loss in precision of the top-k results.Furthermore, by performance evaluations using both theoretical analysis and computer simulations, we show that the proposed algorithm can reduce the bandwidth usage compared with existing probabilistic algorithms.
What problem does this paper attempt to address?