Pruning Strategy for Mining Maximal Frequent Itemsets

MA Zhixin,CHEN Xiaoyun,WANG Xue,LI Longjie
DOI: https://doi.org/10.3321/j.issn:1000-0054.2005.09.006
2005-01-01
Abstract:Mining maximal frequent itemsets is a fundamental problem in many practical web mining applications. This paper presents ESEquivPS extension support equivalency pruning strategy, a new search space pruning strategy for mining maximal frequent itemsets to effectively reduce the search space. ESEquivPS was based on a depthfirst traversal of lexicographic subset enumeration tree and uses equivalency of item's extension supports to prune search space. Furthermore, the MAFIA maximal frequent itemset algorithm was improved by using ESEquivPS. The experimental results show that ESEquivPS can efficiently reduce the search space. Especially on sparse dataset with longer itemsets, the size of search space can be trimmed off by 2/3 and new algorithm runs around three to five times faster than previous MAFIA algorithm.
What problem does this paper attempt to address?