DUET: A Generic Framework for Finding Special Quadratic Elements in Data Streams

Jiaqian Liu,Haipeng Dai,Rui Xia,Meng Li,Ran Ben Basat,Rui Li,Rong Gu,Jiaqi Zheng,Guihai Chen
DOI: https://doi.org/10.1145/3485447.3512019
2022-01-01
Abstract:Finding special items, like heavy hitters, top-k, and persistent items, has always been a hot issue in data stream processing for web analysis. While data streams nowadays are usually high-dimensional, most prior works focus on special items according to a certain primary dimension and yield little insight into the correlations between dimensions. Therefore, we propose to find special quadratic elements to reveal close correlations. Based on the items mentioned above, we extend our problem to three applications related to heavy hitters, top-k, and persistent items, and design a generic framework DUET to process them. Besides, we analyze the error bound of our algorithm and conduct extensive experiments on four data sets. Our experimental results show that DUET can achieve 3.5 times higher throughput and three orders of magnitude lower average relative error compared with cutting-edge algorithms.
What problem does this paper attempt to address?