A Relatively Effective and Practical Load Shedding Strategy for Sliding-Window Join Queries over Data Streams

Longbo Zhang,Zhanhuai Li,Liping Zhu,Jiangtao Liu,Yiqiang Zhao
DOI: https://doi.org/10.3969/j.issn.1000-2758.2006.05.014
2006-01-01
Xibei Gongye Daxue Xuebao/Journal of Northwestern Polytechnical University
Abstract:Aim.The strategies of Refs.2~5 appear,in our opinion,to be still not quite effective and practical for shedding load from sliding-window join queries over data streams.We now present a new strategy that is relatively better.In the full paper,we explain our new strategy in detail;in the abstract,we just add some pertinent remarks to listing the three topics of our explanation:(A) problem description;(B) sliding-window join queries;under topic B,Fig.1 in the full paper is the schematic showing the three operations——insert,probe and invalidate——taken from Ref.2;(C) a load shedding strategy based on the partition of the domain of join attributes;under topic C,we derive eqs.(1) and(2);also under topic C,Fig.2 in the full paper is a schematic showing how to execute this strategy with two operator modules X_1 and X_2;the strategy is essentially that the domain of the join attributes is partitioned into certain sub-domains, and tuples are dropped according to their join values by maintaining simple data stream statistics.We performed two experiments: experiment 1 is concerned with the effect of different skew parameters of zipf distribution;experiment 2 is concerned with the effect of different overloadings.Results of experiments are shown in Figs.3 and 4 in the full paper.Our new strategy needs fewer statistics of input data streams and it makes it convenient to further process the outputs of join operation.It also has good adaptability for different skew parameters of zipf distribution and different peak loads.The theoretical analysis and experiments show preliminarily that the new load shedding strategy is effective and efficient for window join queries.
What problem does this paper attempt to address?