Split block Bloom filters

Jim Apple
DOI: https://doi.org/10.48550/arXiv.2101.01719
2023-01-21
Abstract:This short note describes a Bloom filter variant that takes advantage of modern SIMD instructions to increase speed by 30%-450%. This filter, the split block Bloom filter, is used by StarRocks, Apache Impala, Apache Kudu, Apache Parquet, Apache Arrow, Apache Drill, and Alibaba Cloud's Hologres.
Data Structures and Algorithms
What problem does this paper attempt to address?