Synthesizing: Art of Anonymization

Jun Gu,Yuexian Chen,Junning Fu,Huanchun Peng,Xiaojun Ye
DOI: https://doi.org/10.1007/978-3-642-15364-8_33
2010-01-01
Abstract:Although there are a number of anonymization techniques in the microdata publication, two problems remain: (1) the privacy breaches with auxiliary knowledge; (2) the large information losses during the anonymization. We establish the requirement of presence anonymity and propose the two-step process of synthesizing, consisting of learning a model from the original data, and then sampling a published version with it, which has the similar statistical characteristics and includes fake records. The advantage is that it prevents the auxiliary knowledge attacks as well as enables researchers get correct or approximately correct conclusions. Furthermore, its effectiveness is proved through extensive experiments.
What problem does this paper attempt to address?