SvdNPD: a negative data publication method based on the sensitive value distribution
Linli Wu,Wenjian Luo,Dongdong Zhao
DOI: https://doi.org/10.1109/AISW.2015.7469246
2015-01-01
Abstract:Existing data publication methods retain the relationship between the quasi-identifier attributes and sensitive attributes of published data. We call them "positive data publication". However, it will lead to potential risk that attackers could deduce the privacy of the corresponding individuals from the published data. Recently, by combining the negative representation with k-anonymity and l-diversity, (k, m)- anonNPD and (l, m)-divNPD algorithms have been proposed, which improve the degree of privacy because the sensitive value of each individual is transformed into a negative value randomly. However, to reduce the average error of the reconstructed distribution, the data published by (k, m)-anonNPD and (l, m)-divNPD are often much larger than the original data. In this paper, we take the Sensitive Value Distribution into consideration, and propose a novel Negative Data Publication approach (i.e. SvdNPD), which is based on l-diversity. According to the sensitive value distribution, the negative selection probability of the sensitive attribute in each record can be calculated. Then the data could be published by transforming the sensitive value to a negative one with this probability. Experimental results show that the effective usage of the individual sensitive value distribution could enhance the utility of published data. Meanwhile, compared with (l, 1)-divNPD, SvdNPD just sacrifices a tiny privacy degree which is acceptable.