Searching of interesting itemsets for negative association rules

Hyeok Kong,Dokjun An,Douk Han
DOI: https://doi.org/10.48550/arXiv.1806.07691
2018-06-20
Abstract:In this paper, we propose an algorithm of searching for both positive and negative itemsets of interest which should be given at the first stage for positive and negative association rules mining. Traditional association rule mining algorithms extract positive association rules based on frequent itemsets, for which the frequent itemsets, i.e. only positive itemsets of interest are searched. Further, there are useful itemsets among the frequent itemsets pruned from the traditional algorithms to reduce the search space, for mining of negative association rules. Therefore, the traditional algorithms have not come true to find negative itemsets needed in mining of negative association rules. Our new algorithm to search for both positive and negative itemsets of interest prepares preconditions for mining of all positive and negative association rules.
Databases
What problem does this paper attempt to address?
This paper aims to solve the problem that traditional association rule mining algorithms cannot effectively discover negative association rules. Traditional association rule mining algorithms mainly focus on extracting positive association rules based on frequent itemsets, while ignoring potentially useful information in infrequent itemsets, especially for the mining of negative association rules. Therefore, these algorithms are deficient in finding negative itemsets required for negative association rules. To overcome this limitation, the paper proposes a new algorithm for simultaneously searching for positive and negative itemsets of interest. Through this method, the necessary prerequisites can be provided for the mining of positive and negative association rules, thereby analyzing the associations between data more comprehensively. Specifically, the paper defines positive and negative itemsets of interest and proposes an algorithm framework that can effectively identify these itemsets of interest during the data mining process. This not only helps to reduce the search space and improve efficiency, but also ensures that important negative association rules will not be missed during the mining process.