Abstract:Mining frequent itemsets is a basic and essential task in many data mining applications such as association rules mining and long patterns discovery.Many classic algorithms have been introduced to find the frequent itemsets in database, such as aprior and FP-Tree.Maximal frequent itemsets generation plays an important role in the frequent itemsets mining,because all the frequent itemsets are the subset of the maximal frequent itemsets. Researchers focused on developing efficient algorithm to find frequent itemsets on the following three categories:reducing the number of candidate number, database scan and combining top-down and bottom-up search.Graph-based association rules mining is an excellent method to find the maximal frequent itemsets so as to reduce the number of candidate and the number of database scan.The paradigm maps the data in database to bit vector and construct the entire itemsets information by one database scan.The support of itemsets can be calculated by the logic opreration among bit vetors.Some researchers concentrated on the uplife the performance in graph-based frequent itemsets generation by the basic property of relation graph.Relation graph is constructed by the 2-frequent itemsets in which the vertex presents the specific item, and the edge exsits between two vertexs if the two specific corresponding items are the 2-frequent itemset. Once one itemset is k-frequent itemsets,the subgraph of the vertexs presenting the items in the itemset must be the maximal complete subgraph of the relation graph.That is the way to find the maximal frequent itemsets by using the maximal complete subgraph in the relation graph. To reduce the number of the candidate in the context of forming the k+1-frequent candidate itemsets from the k-frequent itemsets,the next ordering vertex was added to the tail of the k-frequent itemsets on the condition that the new add vertex must have edge with the k items in k-frequent itemsets.The coding method of items was also proposed, in which the item with bigger degee has the smaller ordering code. Besides,some change the undirected graph to directed graph.Bottom-up and top-down search named by Pincer-Search is a search stradgy to cut off the search space.The bottom-up generated non-frequent itemsets can be used to split the top-down maximal frequent itemsets generation,and the top-down generated frequent itmesets can reduce the number of the bottom-up frequent itemsets.The idea of combining the association rule mining based on graph with Pincer-Search to generate maximal frequent itemsets is first introduced in the article, and the algorithm based on the idea is also presented. The bottom up generated 2-non frequent itmesets splits the top-down frequent itemsets is the most costing task,because the problem that the all maximal complete subgrah is got by the 2-non frequent itemsets is NPC problem.The time of generating all candidates is postponed to avoid costing lots of time to generate the candidate maximal frequent candidate itemsets which may not be the real maximal frequent itemsets. Finally, we compare the new algorithm with primitive graph-based association rules mining.

SPIN: mining maximal frequent subgraphs from graph databases.

JPMiner: Mining Frequent Jump Patterns from Graph Databases.

Efficient Algorithms for Summarizing Graph Patterns

Efficient Mining of Frequent Subgraphs in the Presence of Isomorphism

Extracting Frequent Connected Subgraphs from Large Graph Sets.

GraphMiner

Extract Frequent Pattern from Simple Graph Data.

Mining Discriminative Subgraph Patterns from Structural Data

Scalable Mining of Large Disk-Based Graph Databases.

Bottom-up Discovery of Frequent Rooted Unordered Subtrees

Mining Frequent Neighborhood Patterns in Large Labeled Graphs

Maximal frequent itemset feneration based on graph

Representation Learning for Frequent Subgraph Mining

Efficient Mining Of Minimal Distinguishing Subgraph Patterns From Graph Databases

Efficient Mining of Frequent Subgraphs with Two-Vertex Exploration

Visual Graph Mining

GCG: Mining Maximal Complete Graph Patterns from Large Spatial Data

Efficient Algorithms for Densest Subgraph Discovery

Graph-based substructure pattern mining with edge-weight

Diversified Temporal Subgraph Pattern Mining

Efficiently extracting frequent subgraphs using MapReduce