Abstract:Due to the inherent flexibilities in both structure and semantics, XML association rules mining faces few challenges, such as: a more complicated hierarchical data structure and ordered data context. Mining frequent patterns from XML documents can be recast as mining frequent tree structures from a database of XML documents. In this study, we model a database of XML documents as a database of rooted labeled ordered subtrees. In particular, we are mainly concerned with mining frequent induced and embedded ordered subtrees. Our main contributions are as follows. We describe our unique embedding list representation of the tree structure, which enables efficient implementation of our Tree Model Guided (TMG) candidate generation. TMG is an optimal, nonredundant enumeration strategy that enumerates all the valid candidates that conform to the structural aspects of the data. We show through a mathematical model and experiments that TMG has better complexity compared to the commonly used join approach. In this article, we propose two algorithms, MB3-Miner and iMB3-Miner. MB3-Miner mines embedded subtrees. iMB3-Miner mines induced and/or embedded subtrees by using the maximum level of embedding constraint. Our experiments with both synthetic and real datasets against two well-known algorithms for mining induced and embedded subtrees, demonstrate the effectiveness and the efficiency of the proposed techniques.

Efficiently Methods For Embedded Frequent Subtree Mining On Biological Data

An Efficient Way of Frequent Embedded Subtree Mining on Biological Data.

Constrained Frequent Subtree Mining Method

ESPM - An algorithm to mine frequent subtrees

Mining Frequent Rooted Subtrees in XML Data with Me-Tree

Mining Frequent Subtrees from Databases of Labeled Rooted Ordered Trees

An Efficient And Fast Algorithm For Mining Frequent Patterns On Multiple Biosequences

Algorithm Considering Imbalance Across Datasets for Mining Frequent Subgraphs

Efficient Pattern-Growth Methods for Frequent Tree Pattern Mining

PFTM: A Frequent Subtrees Mining Algorithm Based on Projection

A Mining Algorithm for Frequent Patterns Based on Prefix Tree

Bottom-up Discovery of Frequent Rooted Unordered Subtrees

Mining Frequent Patterns with the Pattern Tree.

IMB3-Miner: Mining Induced/embedded Subtrees by Constraining the Level of Embedding

A Novel Efficient Mining Algorithm for Frequent Patterns on Biological Multiple Sequence

DOM-Based Algorithm of Mining Frequent Patterns from XML Data

Mining Maximal Frequent Subtrees Based on Fusion Compression and FP-tree

Mining Frequent Rooted Ordered Tree Generators Efficiently

Chopper: Efficient Algorithm for Tree Mining

A Compact FP-Tree and Array-Technique Based Algorithm for Frequent Patterns Mining

Tree model guided candidate generation for mining frequent subtrees from XML documents