Frequent sequence pattern mining with differential privacy

Yanhui LI,Hao LIU,Ye YUAN,Guoren WANG
DOI: https://doi.org/10.11772/j.issn.1001-9081.2017.02.0316
2017-01-01
Abstract:Focusing on the issue that releasing frequent sequence patterns and the corresponding true supports may reveal the individuals' privacy when the data set contains sensitive information,a Differential Private Frequent Sequence Mining (DPFSM) algorithm was proposed.Downward closure property was used to generate a candidate set of sequence patterns,smart truncating based technique was used to sample frequent patterns in the candidate set,and geometric mechanism was utilized to perturb the true supports of each sampled pattern.In addition,to improve the usability of the results,a threshold modification method was proposed to reduce truncation error and propagation error in mining process.The theoretical analysis show that the proposed method is ε-differentially private.The experimental results demonstrate that the proposed method has lower False Negative Rate (FNR) and Relative Support Error (RSE) than that of the comparison algorithm named PFS2,thus effectively improving the accuracy of mining results.
What problem does this paper attempt to address?