New Sampling-Based Summary Structures for Sliding Windows over Data Streams

Longbo Zhang,Zhanhuai Li,Min Yu,Guangyuan Zhao
DOI: https://doi.org/10.1007/978-3-540-74282-1_140
2007-01-01
Abstract:The main focus in algorithms has been on efficient construction of summary structures for data streams. This paper introduces the problem of construction of summary structures from sliding windows over data streams, and presents a new sampling-based summary structure and new techniques for its fast incremental maintenance. When a new data item v i arrives, a key k i is calculated and a random number X i is generated. The key k i is used to determine if v i will be selected to enter the sample, and X i is used to determine how many data items will be skipped over. The experiments show that the new algorithm is effective and efficient for construction of summary structures from sliding windows over data streams.
What problem does this paper attempt to address?