Abstract:Mining frequent itemsets is a basic and essential task in many data mining applications such as association rules mining and long patterns discovery.Many classic algorithms have been introduced to find the frequent itemsets in database, such as aprior and FP-Tree.Maximal frequent itemsets generation plays an important role in the frequent itemsets mining,because all the frequent itemsets are the subset of the maximal frequent itemsets. Researchers focused on developing efficient algorithm to find frequent itemsets on the following three categories:reducing the number of candidate number, database scan and combining top-down and bottom-up search.Graph-based association rules mining is an excellent method to find the maximal frequent itemsets so as to reduce the number of candidate and the number of database scan.The paradigm maps the data in database to bit vector and construct the entire itemsets information by one database scan.The support of itemsets can be calculated by the logic opreration among bit vetors.Some researchers concentrated on the uplife the performance in graph-based frequent itemsets generation by the basic property of relation graph.Relation graph is constructed by the 2-frequent itemsets in which the vertex presents the specific item, and the edge exsits between two vertexs if the two specific corresponding items are the 2-frequent itemset. Once one itemset is k-frequent itemsets,the subgraph of the vertexs presenting the items in the itemset must be the maximal complete subgraph of the relation graph.That is the way to find the maximal frequent itemsets by using the maximal complete subgraph in the relation graph. To reduce the number of the candidate in the context of forming the k+1-frequent candidate itemsets from the k-frequent itemsets,the next ordering vertex was added to the tail of the k-frequent itemsets on the condition that the new add vertex must have edge with the k items in k-frequent itemsets.The coding method of items was also proposed, in which the item with bigger degee has the smaller ordering code. Besides,some change the undirected graph to directed graph.Bottom-up and top-down search named by Pincer-Search is a search stradgy to cut off the search space.The bottom-up generated non-frequent itemsets can be used to split the top-down maximal frequent itemsets generation,and the top-down generated frequent itmesets can reduce the number of the bottom-up frequent itemsets.The idea of combining the association rule mining based on graph with Pincer-Search to generate maximal frequent itemsets is first introduced in the article, and the algorithm based on the idea is also presented. The bottom up generated 2-non frequent itmesets splits the top-down frequent itemsets is the most costing task,because the problem that the all maximal complete subgrah is got by the 2-non frequent itemsets is NPC problem.The time of generating all candidates is postponed to avoid costing lots of time to generate the candidate maximal frequent candidate itemsets which may not be the real maximal frequent itemsets. Finally, we compare the new algorithm with primitive graph-based association rules mining.

Non-Almost-Derivable Frequent Itemsets Mining

Mining Noise-Tolerant Frequent Closed Itemsets in Very Large Database.

A New Algorithm for Mining Global Frequent Itemsets in a Stream.

Mining Associated and Item-Item Correlated Frequent Patterns

Gc-Tree: A Fast Online Algorithm For Mining Frequent Closed Itemsets

Mining Maximum Length Frequent Itemsets: A Summary of Results

Mining Approximate Frequent Itemsets from Noisy Data

Dtgc-Tree: A New Strategy Of Association Rules Mining

A New Concise Representation Method of Generalized Frequent Itemsets

Summary queries for frequent itemsets mining

A Concise Representation of Generalized Frequent Itemsets Based on Profile Summary

An Algorithm of Mining Frequent Itemsets Based on Bloom Filter

Mining Algorithm of Frequent Closed Itemsets Based on Uncertain Data

MapReduce-based Parallelized Approximation of Frequent Itemsets Mining in Uncertain Data.

Mining Approximate Frequent Itemsets in the Presence of Noise: Algorithm and Analysis

A STABLE PARALLEL DISTRIBUTED FREQUENT ITEMSET MINING ALGORITHM AND ITS APPLICATION

Mining frequent itemset from uncertain data

Maximal frequent itemset feneration based on graph

CG-FHAUI: an efficient algorithm for simultaneously mining succinct pattern sets of frequent high average utility itemsets

Frequent itemsets compressing based on minimum cover: An efficient method for mining medication law of Chinese herbs

Frequent Item-set Mining without Ubiquitous Items