Analyzing Preference Data with Local Privacy: Optimal Utility and Enhanced Robustness

Shaowei Wang,Xuandi Luo,Yuqiu Qian,Jiachun Du,Wenqing Lin,Wei Yang
DOI: https://doi.org/10.1109/tkde.2022.3207486
IF: 9.235
2022-01-01
IEEE Transactions on Knowledge and Data Engineering
Abstract:Online service providers benefit from collecting and analyzing preference data from users, including both implicit preference data (e.g., watched videos of a user) and explicit preference data (e.g., ranking data over candidates). However, it brings ethical and legal issues of data privacy at the same time. In this paper, we study the problem of aggregating individual's preference data in the local differential privacy (LDP) setting. One naive approach is to add Laplace random noises, which however suffers from low statistical utility and is fragile to LDP-specific poisoning attacks. Therefore, we propose a novel mechanism to improve the utility and the robustness simultaneously: the additive mechanism. The additive mechanism randomly outputs a subset of candidates with a probability proportional to their total scores. For preference data with Borda rule over d items, its mean squared error bound is optimized from O(d(5)/ne(2)) to O(d(4)/ne(2)),and its maximum poisoning risk bound is reduced from +8 to O(d(2)/ne). We also theoretically investigate minimax lower bounds of e-LDP preference data aggregation, and prove the error rate of O(d(4)/ne(2)) is optimal for the Borda rule. Experimental results validate that our proposed approaches averagely reduce estimation error by 50% and are more robust to adversarial poisoning attacks.
What problem does this paper attempt to address?