Abstract:High utility itemsets are sets of items that have a high utility (e.g. a high profit or a high importance) in a transaction database. Discovering high utility itemsets has many important applications in real-life such as market basket analysis. Nonetheless, mining these patterns is a time-consuming process due to the huge search space and the high cost of utility computation. Most of previous work is devoted to search space pruning but pay little attention to utility computation. Factually, not only search space pruning but also high utility itemset identification have to resort to the computation of various utilities. This paper proposes a novel algorithm named REX (Rapid itEmset eXtraction), which extends the classic d<math>2</math>HUP algorithm with an improved structure, a <math>k</math>-item utility machine, and an efficient switch strategy. The structure can significantly reduce the time complexity of utility computation compared with the original structure used in d<math>2</math>HUP. The machine can quickly merge identical transactions and applies an efficient procedure for computing the utilities of extensions of a given itemset. The strategy derived from trial and error drastically gives rise to performance improvement on some databases and is also competitive with the switch strategy used in d<math>2</math>HUP on other databases. Experimental results show that REX achieves a speedup of from fifty percent to three orders of magnitude over d<math>2</math>HUP even though they use identical pruning techniques and that REX considerably outperforms state-of-the-art algorithms on real-life and synthetic databases.

Re-induction based mining for high utility item-sets

An efficient mining scheme for high utility itemsets

Top- k high utility itemset mining: current status and future directions

A Survey of High-utility Itemsets Mining

Incremental high average-utility itemset mining: survey and challenges

High-utility itemset mining for subadditive monotone utility functions

Beyond Frequency: Utility Mining with Varied Item-Specific Minimum Utility

Efficient High-utility Itemset Mining Based on a Novel Data Structure

FHUQI-Miner: Fast high utility quantitative itemset mining

Itemset Utility Maximization with Correlation Measure

IPHM: Incremental periodic high-utility mining algorithm in dynamic and evolving data environments

FUIM: Fuzzy Utility Itemset Mining

An Efficient Structure for Fast Mining High Utility Itemsets

TOPIC: Top-k High-Utility Itemset Discovering

A Comparative Study of Top-K High Utility Itemset Mining Methods

Mining summarization of high utility itemsets

Mining high utility itemsets using extended chain structure and utility machine

UBP-Miner: An efficient bit based high utility itemset mining algorithm

OSUMI: On-Shelf Utility Mining from Itemset-based Data

Discovery of Interesting Itemsets for Web Service Composition Using Hybrid Genetic Algorithm

High-utility itemsets mining based on binary particle swarm optimization with multiple adjustment strategies