SketchConf: A Framework for Automatic Sketch Configuration.

Ruijie Miao,Fenghao Dong,Yikai Zhao,Yiming Zhao,Yuhan Wu,Kaicheng Yang,Tong Yang,Bin Cui
DOI: https://doi.org/10.1109/icde55515.2023.00157
2023-01-01
Abstract:Sketches have risen as promising solutions for frequency estimation, which is one of the most fundamental tasks in approximate data stream processing. In many scenarios, users have a strong demand to apply sketches under the expected error constraints. In this paper, we explore how to configure sketch parameters to satisfy user-defined error constraints. We propose SketchConf, an automatic sketch configuration framework, which efficiently generates memory-optimal configurations for the first time. We show that SketchConf can be applied to order-independent sketches, including CM, Count, Tower, and Nitro sketches. We further discuss how to deal with the unknown and changeable workloads when applying SketchConf to the real scenarios of streaming data processing. Experimental results show that SketchConf can be up to 715.51 times faster than the baseline algorithm, and the outputted configurations save up to 99.99% memory and achieve up to 27.44 times throughput, compared with the theory-based configurations. The code is open sourced at Github.
What problem does this paper attempt to address?