Abstract:Mining frequent itemsets is a basic and essential task in many data mining applications such as association rules mining and long patterns discovery.Many classic algorithms have been introduced to find the frequent itemsets in database, such as aprior and FP-Tree.Maximal frequent itemsets generation plays an important role in the frequent itemsets mining,because all the frequent itemsets are the subset of the maximal frequent itemsets. Researchers focused on developing efficient algorithm to find frequent itemsets on the following three categories:reducing the number of candidate number, database scan and combining top-down and bottom-up search.Graph-based association rules mining is an excellent method to find the maximal frequent itemsets so as to reduce the number of candidate and the number of database scan.The paradigm maps the data in database to bit vector and construct the entire itemsets information by one database scan.The support of itemsets can be calculated by the logic opreration among bit vetors.Some researchers concentrated on the uplife the performance in graph-based frequent itemsets generation by the basic property of relation graph.Relation graph is constructed by the 2-frequent itemsets in which the vertex presents the specific item, and the edge exsits between two vertexs if the two specific corresponding items are the 2-frequent itemset. Once one itemset is k-frequent itemsets,the subgraph of the vertexs presenting the items in the itemset must be the maximal complete subgraph of the relation graph.That is the way to find the maximal frequent itemsets by using the maximal complete subgraph in the relation graph. To reduce the number of the candidate in the context of forming the k+1-frequent candidate itemsets from the k-frequent itemsets,the next ordering vertex was added to the tail of the k-frequent itemsets on the condition that the new add vertex must have edge with the k items in k-frequent itemsets.The coding method of items was also proposed, in which the item with bigger degee has the smaller ordering code. Besides,some change the undirected graph to directed graph.Bottom-up and top-down search named by Pincer-Search is a search stradgy to cut off the search space.The bottom-up generated non-frequent itemsets can be used to split the top-down maximal frequent itemsets generation,and the top-down generated frequent itmesets can reduce the number of the bottom-up frequent itemsets.The idea of combining the association rule mining based on graph with Pincer-Search to generate maximal frequent itemsets is first introduced in the article, and the algorithm based on the idea is also presented. The bottom up generated 2-non frequent itmesets splits the top-down frequent itemsets is the most costing task,because the problem that the all maximal complete subgrah is got by the 2-non frequent itemsets is NPC problem.The time of generating all candidates is postponed to avoid costing lots of time to generate the candidate maximal frequent candidate itemsets which may not be the real maximal frequent itemsets. Finally, we compare the new algorithm with primitive graph-based association rules mining.

Maximal Frequent Item Sequences Mining of Datasets with Few Attributes and Large Instances

A New Algorithm for Mining Global Frequent Itemsets in a Stream.

Approximate mining of global closed frequent itemsets over data streams

Mining Noise-Tolerant Frequent Closed Itemsets in Very Large Database.

Gc-Tree: A Fast Online Algorithm For Mining Frequent Closed Itemsets

Finding Frequent Closed Itemsets in Sliding Window in Linear Time.

Efficient algorithms for deriving complete frequent itemsets from frequent closed itemsets

Mining Maximum Length Frequent Itemsets: A Summary of Results

Maximal frequent itemset feneration based on graph

Incremental frequent itemsets mining based on frequent pattern tree and multi-scale

MFS-SubSC: an efficient algorithm for mining frequent sequences with sub-sequence constraint

Discovery of Maximal Frequent Item Sets using Subset Creation

Algorithms for Mining Frequent Itemsets with Multi-Predication Constraints Based on Frequent Pattern Growth

PFIMD: a parallel MapReduce-based algorithm for frequent itemset mining

Frequent Item-set Mining without Ubiquitous Items

FRI-Miner: Fuzzy Rare Itemset Mining

Multiobjective-integer-programming-based Sensitive Frequent Itemsets Hiding.

A STABLE PARALLEL DISTRIBUTED FREQUENT ITEMSET MINING ALGORITHM AND ITS APPLICATION

Discovering Periodic Patterns Common to Multiple Sequences

Probabilistic Support Prediction: Fast Frequent Itemset Mining in Dense Data

TFP: an Efficient Algorithm for Mining Top-K Frequent Closed Itemsets