An Effective Scheme for Top-K Frequent Itemset Mining under Differential Privacy Conditions

Wenjuan Liang,Hong Chen,Jing Zhang,Dan Zhao,Cuiping Li
DOI: https://doi.org/10.1007/s11432-018-9849-y
2020-01-01
Science China Information Sciences
Abstract:Dear editor, Frequent itemset mining (FIM) is important in many data mining applications [1],such as web log mining and trend analysis.However,if the data are sensitive (e.g.,web browsing history),directly releasing frequent itemsets and their support may breach user privacy.The protection of user privacy while obtaining statistical information is important.Differential privacy (DP) is a strong and rigorous standard for privacy protection.In this study,we focused on effectively discovering top-k frequent itemsets under DP conditions.By adding a carefully selected amount of noise,DP ensures that the output of a computation is not sensitive to any individual tuple,and thus,user's privacy can be protected.The amount of noise is determined by the privacy budget e and the sensitivity.
What problem does this paper attempt to address?