Privacy Preserving Naive Bayes Classification

Peng Zhang,Yunhai Tong,Shiwei Tang,Dongqing Yang
DOI: https://doi.org/10.1007/11527503_88
2007-01-01
Chinese Journal of Computers
Abstract:Privacy preserving data mining is to discover accurate patterns without precise access to the original data. In this paper, we combine the two strategies of data transform and data hiding to propose a new randomization method, Randomized Response with Partial Hiding (RRPH), for distorting the original data. Then, an effective naive Bayes classifier is presented to predict the class labels for unknown samples according to the distorted data by RRPH. Shown in the analytical and experimental results, our method can obtain significant improvements in terms of privacy, accuracy, and applicability.
What problem does this paper attempt to address?