Determining error bounds for spectral filtering based reconstruction methods in privacy preserving data mining

Songtao Guo,Xintao Wu,Yingjiu Li
DOI: https://doi.org/10.1007/s10115-008-0123-9
IF: 2.7
2008-01-01
Knowledge and Information Systems
Abstract:Additive randomization has been a primary tool for hiding sensitive private information. Previous work empirically showed that individual data values can be approximately reconstructed from the perturbed values, using spectral filtering techniques. This poses a serious threat of privacy breaches. In this paper we conduct a theoretical study on how the reconstruction error varies, for different types of additive noise. In particular, we first derive an upper bound for the reconstruction error using matrix perturbation theory. Attackers who use spectral filtering techniques to estimate the true data values may leverage this bound to determine how close their estimates are to the original data. We then derive a lower bound for the reconstruction error, which can help data owners decide how much noise should be added to satisfy a given threshold of the tolerated privacy breach.
What problem does this paper attempt to address?