A Unified Framework for Answering k Closest Pairs Queries and Variants

muhammad aamir cheema,xuemin lin,haixun wang,jianmin wang,wenjie zhang
DOI: https://doi.org/10.1109/TKDE.2014.2304469
2014-01-01
Abstract:Given a scoring function that computes the score of a pair of objects, a top- pairs query returns pairs with the smallest scores. In this paper, we present a unified framework for answering generic top- pairs queries including -closest pairs queries, -furthest pairs queries and their variants. Note that -closest pairs query is a special case of top- pairs queries where the scoring function is the distance between the two objects in a pair. We are the first to present a unified framework to efficiently answer a broad class of top- queries including the queries mentioned above. We present efficient algorithms and provide a detailed theoretical analysis that demonstrates that the expected performance of our proposed algorithms is optimal for two dimensional data sets. Furthermore, our framework does not require pre-built indexes, uses limited main memory and is easy to implement. We - lso extend our techniques to support top- pairs queries on multi-valued (or uncertain) objects. We also demonstrate that our framework can handle exclusive top- pairs queries. Our extensive experimental study demonstrates effectiveness and efficiency of our proposed techniques.
What problem does this paper attempt to address?