A Compact FP-Tree and Array-Technique Based Algorithm for Frequent Patterns Mining

Qin Liangxi,Su Yongxiu,Liu Yongbin,Liang Bizhen
2008-01-01
Journal of Computer Research and Development
Abstract:FP-growth algorithm is an efficient algorithm for mining frequent patterns. It scans database only twice and does not need to generate and test the candidate sets that is quite time consuming. The efficiency of the FP-growth algorithm outperforms previously developed algorithms. But, it must recursively generate huge number of conditional FP-trees that requires huge volume of memory and costs a lot of time.In this paper, it is presented an algorithm, CFPmine, which is inspired by several previous works. CFPmine algorithm combines several advantages of existing techniques. One is using constrained subtrees of a compact FP-tree to mine frequent pattern, so that it is doesn't need to construct conditional FP-trees in the mining process and reduces the memory consume. Second is using an array-based technique to reduce the traverse time to the CFP-tree. And a unified memory management is also implemented in the algorithm. The experimental evaluation shows that CFPmine algorithm is a high performance algorithm. It outperforms Apriori, Eclat and FP-growth and requires less memory than FP-growth.
What problem does this paper attempt to address?