An Approximate Duplicate-Elimination In Rfid Data Streams Based On D-Left Time Bloom Filter

Xiujun Wang,Yusheng Ji,Baohua Zhao
DOI: https://doi.org/10.1007/978-3-319-11116-2_36
2014-01-01
Abstract:There are a larger number of duplicates in RFID data streams, due to the multiple readings of an RFID tag by one RFID reader or by some RFID readers deployed to the same region in an RFID based system. Existing duplicate-elimination methods based on Time Bloom filter (TBF) require multiple counters to store the detected time of an element in RFID data streams, thus waste valuable memory resources. In this paper, we devise d-left Time Bloom filter (DLTBF) as an extension of d-left Counting Bloom filter. With the d-left hashing, a balanced allocation mechanism, DLTBF can store the detected time of an element into one counter. Then we propose an one-pass approximate method to remove duplicates in RFID data streams based on DLTBF. In an RFID data stream, suppose that the detected time of an element is T-bit, i.e., T bits are required to store the detected time of an element in RFID data streams, the number of non-duplicate elements within a time length of tau is W and the probability that a non-duplicate element is taken to be a duplicate by our method is epsilon(the false positive probability), then the number of bits used by our method is O(W log(2) (1/epsilon) vertical bar WT). Experimental results on the synthetic data verify the effectiveness of our method.
What problem does this paper attempt to address?