Space Efficient Quantile Summary for Constrained Sliding Windows on a Data Stream

Jian Xu,Xuemin Lin,Xiaofang Zhou
DOI: https://doi.org/10.1007/978-3-540-27772-9_5
2004-01-01
Abstract:In many online applications, we need to maintain quantile statistics for a sliding window on a data stream. The sliding windows in natural form are defined as the most recent N data items. In this paper, we study the problem of estimating quantiles over other types of sliding windows. We present a uniform framework to process quantile queries for time constrained and filter based sliding windows. Our algorithm makes one pass on the data stream and maintains an E-approximate summary. It uses O((1)/(epsilon2) log(2) epsilonN) space where N is the number of data items in the window. We extend this framework to further process generalized constrained sliding window queries and proved that our technique is applicable for flexible window settings. Our performance study indicates that the space required in practice is much less than the given theoretical bound and the algorithm supports high speed data streams.
What problem does this paper attempt to address?