An Efficient Algorithm For Frequent Pattern Mining Based On Privacy-Preserving

Yaling Zhang,Ting Wang,Shangping Wang
DOI: https://doi.org/10.1109/FSKD.2017.8393021
2017-01-01
Abstract:With the arrival of large data age, efficiency in mining had become the bottleneck of frequent pattern mining algorithms for privacy-preserving. Aimed at the problem that bit-and operation and the count of a large number of subsets consume plenty of time and system memory for privacy-preserving data mining algorithm BEMASK in tens of thousands of candidate sets, the PBEK algorithm is proposed based on partition and the prior constraint based on probability. In the new algorithm, the bitmaps are divided into several homogeneous overlapping pieces, and the BEMASK algorithm is run on each small piece. Through the prior constraint based on probability, the number of local frequent item sets is reduced, and the global frequent itemsets are generated efficiently by scanning the global bitmaps. The results of the experiment show that the improved algorithm has higher efficiency than the BEMASK algorithm.
What problem does this paper attempt to address?