A New Algorithm for Mining Association Rules Based on Dataset Division

YANG Cui-Ming,LIU Xi-Pin,XIONG Gao-Feng,LUO Long-Fu
2007-01-01
Abstract:Fp-growth algorithm has disadvantages such as lower space utilization rate and slower execution time when mining the large datasets.To overcome these drawbacks,based on the Fp-growth algorithm,this paper proposed a new algorithm for mining association rules from large datasets.The algorithm adopts a new strategy to divide the large datasets into many subsets,and then,carries out constrained frequent item sets mining for each subset.Experiments have been conducted to compare the proposed algorithm with the Fp-growth algorithm.Experimental results show that the algorithm has lower memory usage,and is faster than the Fp-growth algorithm when the datasets is very large.
What problem does this paper attempt to address?