Algorithm Based on Sliding Window for Similarity Queries over Data Stream

WANG Kao-jie,ZHENG Xue-feng,Song Yi-ding
DOI: https://doi.org/10.3969/j.issn.1002-137x.2010.10.039
2010-01-01
Abstract:Similarity queries are fundamental part of modern data mining application.But traditional query algorithms can not be applied on data stream,which is an unbounded sequence of data elements generated at a rapid rate.We proposed a novel approach for computing similarity over multi data streams based on wavelet sliding window model.The basic idea is to divide sliding window into equally-sized basic windows and represent the data elements of a basic window using wavelet coefficients,then form wavelet synopses window.As a result,queries toward data streams can be converted to queries toward such wavelet synopses.This algorithm takes advantage of the merit of wavelet decomposition for linear computing and achieves superior runtime performance.The extensive experiments verified the effectiveness of our algorithm.
What problem does this paper attempt to address?