A Fast Algorithm for Hiding High Utility Sequential Patterns

Chunkai Zhang,Yiwen Zu,Junli Nie,Linzi Du,Jingqi Du,Siyuan Hong,Wenping Wu
DOI: https://doi.org/10.1109/ispa-bdcloud-sustaincom-socialcom48970.2019.00189
2019-01-01
Abstract:High utility sequential patterns (HUSPs) are common patterns that can be discovered from the data collected in many domains (e.g. retail, bioinformatics, mobile commerce). To extract these patterns, high utility sequential pattern mining (HUSPM) has been proposed in recent decade. Although the HUSPM algorithms provide us a special perspective to analyze the knowledge behind the collected data, it also arises the risk of the privacy leakage and underlying security issues. This leads to the emergence of high utility sequential pattern hiding (HUSPH) whose purpose is to hide all HUSPs in the sequence database under a specified threshold. Around this topic, many algorithms were proposed. However, the existing algorithms are very time-consuming, which makes them unable to process the real massive data quickly. In this paper, we propose an efficient algorithm named FH-HUSP (fast algorithm for hiding high utility sequential patterns) for HUSPH. Substantial experimental results show that the proposed algorithm can hide all high utility sequential patterns quickly under the specific minimum utility with relatively small modifications.
What problem does this paper attempt to address?