An Algorithm of Mining Frequent Itemsets Based on Bloom Filter

林海
2009-01-01
Abstract:Mining maximum frequent itemsets is a key problem in data mining.Aiming at solving the scalable problem for mining frequent itemsets,based on the theory of the Bloom Filter,an algorithm called Mining Top-K is proposed.It can not only mine the K-most frequent elements,but also circumvent the scalable problem of mining frequent itemsets.Especially,with the application of the extended Bloom Filter,the algorithm finding the K-most elements can compute the frequency of the K-most frequent elements.Experiments demonstrate that the algorithm can achieve space saving without sacrificing accuracy.
What problem does this paper attempt to address?