A comparison between local and global recoding algorithms for achieving microdata -sensitive -anonymity.

Traian Marius Truta,A. Campan,Michael Abrinica,John Miller
Abstract:New privacy regulations together with ever-increasing data availability and computational power have created a huge interest in data privacy research. One major research direction is built around k -anonymity property, which is required for the released data. Although k -anonymity protects against identity disclosure, it fails to provide an adequate level of protection with respect to attribute disclosure. We introduced a new privacy protection property called p-sensitive k -anonymity that avoids this shortcoming. We developed new algorithms (GreedyPKClustering and EnhancedPKClustering) and adapted an existing algorithm (Incognito) to generate masked microdata with p-sensitive k -anonymity property. All these algorithms try to reduce the amount of information lost while transforming data to conform to p-sensitive k -anonymity. They are different in the masking methods they use. The new algorithms are based on local recoding masking methods. Incognito, initially designed for k -anonymity, uses global recoding for masking. This paper’s goal is to compare the impact of the masking method on the quality of the masked microdata obtained. For this we compare the quality of the results (cost measures based on data utility) and the efficiency (running time) of these three algorithms for masking both real and synthetic data sets. 2000 Mathematics Subject Classification: 68P15, 68U35.
What problem does this paper attempt to address?