FP-Growth Algorithm Based on Boolean Matrix and MapReduce

Xing-shu Chen,Shuai Zhang,Hao Tong,Xiao-jing Cui
DOI: https://doi.org/10.3969/j.issn.1000-565X.2014.01.023
2014-01-01
Abstract:Association rules mining is an important part of data mining.In order to improve the efficiency of associ-ation rules mining,an FP-Growth algorithm based on Boolean matrix and MapReduce,which is marked as BPFP, is proposed,with its time and space complexity being also analyzed.In BPFP algorithm,Hadoop framework and Boolean matrix are used to reduce the number of scans of transaction data,and twice Map-Reduce is adopted to mine frequent item sets.Experimental results on multiple data sets show that the improved FP-Growth algorithm is superior to the original one due to its high execution efficiency and speedup.
What problem does this paper attempt to address?