An Algorithm of Association Rules Mining in Large Databases Based on Sampling

Zhi Liu,Tianhong Sun,Guoming Sang
DOI: https://doi.org/10.14257/ijdta.2013.6.6.09
2013-01-01
International Journal of Database Theory and Application
Abstract:In recent years, the amount of data into a geometric growth puts forward higher requirements on data mining algorithm.In the process of frequent itemsets of traditional Apriori algorithm produced, frequent itemsets' generation and storage are quite a waste of time and space.In this paper, we put forward a new Hash table and use the technology to improve the algorithm and get SamplingHT algorithm, through a lot of contrast experiments showed that the new algorithm enhances performance when frequent itemset is generated, and effectively reduce the database scan times, In order to achieve more optima.
What problem does this paper attempt to address?