Efficient Top-K Query Processing in P2p Network

Yingjie He,Yanfeng Shu,Shan Wang,Xiaoyong Du
DOI: https://doi.org/10.1007/978-3-540-30075-5_37
2005-01-01
Journal of Software
Abstract:In this paper, we examine how to efficiently process top-k queries in pure P2P network. First, we propose a distributed algorithm to evaluate a top-k query in a hierarchical way. Each peer does its local top-k query, and top k results from different peers are merged hierarchically from bottom to the top (the root peer is the query initiator). Ranking and merging of results are distributed across the peers to exploit the computing resources in the network. Second, to improve performance, we maintain histograms at each peer according to the top k results returned by the peers. The histograms are used to estimate the possible upper bound scores of peers so that a query only needs to be forwarded to the most promising neighbouring peers. Our experimental study shows that the top-k query algorithm improves the query effectiveness, while the use of histograms enhances the query efficiency.
What problem does this paper attempt to address?