Mining insightful classification rules directly and efficiently

Hongyan Liu,Jian Chen,Guoqing Chen
DOI: https://doi.org/10.1109/ICSMC.1999.823349
1999-01-01
Abstract:Classification is one of the important problems in the field of data mining. Many algorithms have been proposed to solve this problem and each has its own drawback. This paper discusses issues about mining classification rules directly and proposes two algorithms, namely UARC and GARC. These algorithms use a more suitable association rule mining technique to find insightful and a complete set of rules directly and accurately. Unlike most other association rule mining algorithms, the algorithms proposed in the paper can find both frequent k-itemset and rules at the same step. After each scan of the database, only rule itemsets and excluded itemsets are saved and used to exclude much more itemsets to generate larger candidate itemsets, which will save much computation time and memory. Using the information gain criterion, many training cases which satisfy a special condition can be deleted from database, which will lead to fewer I/O times for every remaining scan of a database. Finally, a criterion is defined to terminate the whole mining process much earlier and at the same time produce a meaningful rule
What problem does this paper attempt to address?