A Basic-Window Based Priority-Sample Algorithm For Sliding Windows Over Data Streams

Longbo Zhang,Zhanhuai Li,Min Yu,Yun Jiang
2007-01-01
Abstract:This paper introduces the problem of random sampling fro. in time-based sliding windows over data streams and presents a basic-window based priority-sample (BWPS) algorithm for this problem. In the BWPS algorithm, a key is calculated for each data item in the time-based sliding window, and data items with larger keys are selected to enter the sample to replace the data items with smaller keys. The algorithm extends the priority-sampling (PS) algorithm and achieves a significant improvement on efficiency, and has similar performance on storage comparing with PS algorithm. It can work well even when the number of data items in a sliding window varies dynamically over time. The experiments show that the BWPS algorithm is effective and efficient for processing data items from sliding windows over data streams.
What problem does this paper attempt to address?