Seesaw Counting Filter: an Efficient Guardian for Vulnerable Negative Keys During Dynamic Filtering

Meng Li,Deyi Chen,Haipeng Dai,Rongbiao Xie,Siqiang Luo,Rong Gu,Tong Yang,Guihai Chen
DOI: https://doi.org/10.1145/3485447.3511996
2022-01-01
Abstract:Bloom filter is an efficient data structure for filtering negative keys (keys not in a given set) with substantially small space. However, in real-world applications, there widely exist vulnerable negative keys, which will bring high costs if not being properly filtered, especially when positive keys are added/deleted dynamically. Such problem gets more severe when keys within one set are dynamically added or deleted. Recently, there are works focusing on handling such (vulnerable) negative keys by incorporating learning techniques. These learning-based filters fail to work as the learning techniques can hardly handle incremental insertions or deletions. To address the problem, we propose S ee S aw C ounting F ilter (SSCF), which is innovated with encapsulating the vulnerable negative keys into a unified counter array named seesaw counter array, and dynamically modulating (or varying) the applied hash functions to guard the encapsulated keys from being misidentified. Moreover, we design ada-SSCF to handle the scenarios where the vulnerable negative keys cannot be obtained in advance. We extensively evaluate our SSCF, which shows that SSCF outperforms the cutting-edge filters by $3\times$ on averages regarding accuracy while ensuring a low operation latency. All source codes are in (SSCF-authors).
What problem does this paper attempt to address?