A Generic Framework for Top-<formula Formulatype="inline"><tex Notation="TeX">${\schmi K}$</tex></formula> Pairs and Top- <formula Formulatype="inline"><tex Notation="TeX">${\schmi K}$</tex></formula> Objects Queries over Sliding Windows

Zhitao Shen,Muhammad Aamir Cheema,Xuemin Lin,Wenjie Zhang,Haixun Wang
DOI: https://doi.org/10.1109/tkde.2012.181
IF: 9.235
2014-01-01
IEEE Transactions on Knowledge and Data Engineering
Abstract:Top-k pairs and top-k objects queries have received significant attention by the research community. In this paper, we present the first approach to answer a broad class of top-k pairs and top-k objects queries over sliding windows. Our framework handles multiple top-k queries and each query is allowed to use a different scoring function, a different value of k, and a different size of the sliding window. Furthermore, the framework allows the users to define arbitrarily complex scoring functions and supports out-of-order data streams. For all the queries that use the same scoring function, we need to maintain only one K-skyband. We present efficient techniques for the K-skyband maintenance and query answering. We conduct a detailed complexity analysis and show that the expected cost of our approach is reasonably close to the lower bound cost. For top-k pairs queries, we demonstrate the efficiency of our approach by comparing it with a specially designed supreme algorithm that assumes the existence of an oracle and meets the lower bound cost. For top-k objects queries, our experimental results demonstrate the superiority of our algorithm over the state-of-the-art algorithm.
What problem does this paper attempt to address?