Efficient Top-K Query Algorithms Using K-Skyband Partition

Zhenqiang Gong,Guang-Zhong Sun,Jing Yuan,Yanjing Zhong
DOI: https://doi.org/10.1007/978-3-642-10485-5_21
2012-01-01
Abstract:Efficient processing of top-k queries has become a classical research area. Fagin et al. proposed the "middleware cost" for a top-k query algorithm. In some scenario, there is no way to perform a random access, and Fagin et al. proposed NRA (No Random Access) algorithm for that. In this paper, we investigate the intrinsic relation between top-k queries and K-skyband queries. Based on that relation, we propose a novel algorithm DNRA (Dominate-NRA). The main idea of DNRA is to partition the original dataset into two sub-datasets depending on whether they belong to K-skyband or not. We prove that DNRA performs no more sorted accesses than NRA on any dataset. Furthermore, we partition the dataset into N sub-datasets (N is the number of objects in the dataset), and then we propose our algorithm ADNRA (Advanced-DNRA). The partition of the dataset is pre-computed, and we discuss two techniques to fulfill it. Extensive experiments show that our algorithms perform several orders of magnitude fewer accesses than NRA and that ADNRA performs significantly fewer accesses than DNRA on some datasets.
What problem does this paper attempt to address?