Strategies of Efficiency Improvement for Eclat Algorithm

冯培恩,刘屿,邱清盈,李立新
DOI: https://doi.org/10.3785/j.issn.1008-973x.2013.02.005
2013-01-01
Abstract:For the purpose of efficiency improvement,Eclat algorithm was optimized in three aspects-pruning,itemsets connection and intersection.Firstly,the equivalence classes were divided in the suffix-based way to make the best of pruning in which a double layer hash table was utilized to accelerate the search process of subsets of candidate itemsets.Secondly,a partition list of the set of itemsets was presented to eliminate the connection judgment of itemsets.Finally,a transaction id(Tid) lost threshold was introduced to speed up intersection.Based on the above three improvement strategies an Eclat_opt algorithm was proposed.The performance comparison between the Eclat_opt algorithm,the original Eclat algorithm(ZAKI) and two other improved Eclat algorithms Diffset(ZAKI),hEclat(XIONG Zhong-yang) showed that the efficiency of the Eclat_opt algorithm ranked the first among the four algorithms on sparse datasets,and its overall time performance was the best.
What problem does this paper attempt to address?