REncoder: A Space-Time Efficient Range Filter with Local Encoder.

Ziwei Wang,Zheng Zhong,Jiarui Guo,Yuhan Wu,Haoyu Li,Tong Yang,Yaofeng Tu,Huanchen Zhang,Bin Cui
DOI: https://doi.org/10.1109/icde55515.2023.00158
2023-01-01
Abstract:A range filter is a data structure to answer range membership queries. Range queries are common in modern applications, and range filters have gained rising attention for improving the performance of range queries by ruling out empty range queries. However, state-of-the-art range filters, such as SuRF and Rosetta, suffer either high false positive rate or low throughput. In this paper, we propose a novel range filter, called REncoder. It organizes all prefixes of keys into a segment tree, and locally encodes the segment tree into a Bloom filter to accelerate queries. REncoder supports diverse workloads by adaptively choosing how many levels of the segment tree to store. We theoretically prove that the error of REncoder is bounded and derive the asymptotic space complexity under the bounded error. We conduct extensive experiments on both synthetic datasets and real datasets. The experimental results show that REncoder outperforms all state-of-the-art range filters.
What problem does this paper attempt to address?