Hiding Sensitive Sequential Patterns by Computing Impact Weight

HUA Bei,ZHONG Cheng,HUANG Zhao-ming,YANG Liu
2010-01-01
Abstract:The data mining based on privacy protection has been one of the hot research topics in recent years.This paper presents a hidden sensitive sequential patterns algorithm using data sanitization.It sanitizes the transactions of the sequence that has minimum impact on the non-sensitive pattern by computing the impact weight of the transactions such that it can hide sensitive sequential patterns and minimize the impact on the non-sensitive pattern set at the same time.The experimental results with the different density and different sizes of data sets show that the presented algorithm can protect the sensitive patterns with a lower mistake hidden rate,and the difference between the original and sanitized sequence database has no significant changes with the changes of data set size.
What problem does this paper attempt to address?