Collecting Preference Rankings under Local Differential Privacy
Xiang Cheng,Jianyu Yang,Yufei Wang,Rui Chen,Sen Su,Yuejia Li
DOI: https://doi.org/10.1109/tkde.2022.3186907
IF: 9.235
2022-01-01
IEEE Transactions on Knowledge and Data Engineering
Abstract:With the deep penetration of the Internet and mobile devices, preference rankings are being collected on a massive scale by diverse data collectors for various business demands. However, users’ preference rankings in many applications are highly sensitive. Without proper privacy protection mechanisms, it either puts individual privacy in jeopardy or hampers business opportunities due to users’ unwillingness to share their true rankings. In this paper, we initiate the study of collecting preference rankings under local differential privacy. The key technical challenge comes from the fact that the number of possible rankings could be large in practical settings, leading to excessive injected noise. To solve this problem, we present a novel approach SAFARI, whose main idea is to collect a set of distributions over small domains which are carefully chosen based on the riffle independent (RI) model to approximate the overall distribution of users’ rankings, and then generate a synthetic ranking dataset from the obtained distributions. By working on small domains instead of a large domain, SAFARI can significantly reduce the magnitude of added noise. In SAFARI, we design two transformation rules, namely Rule I and Rule II, to instruct users to transform their data to provide the information about the distributions of the small domains. In particular, we propose a method called LADE to precisely estimate the required distributions used for the structure learning of RI model. We also propose a new LDP method called SAFA for frequency estimation over multiple attributes that have small domains. We formally prove that SAFARI guarantees $\varepsilon$ɛ-local differential privacy. Extensive experiments on real datasets confirm the effectiveness of SAFARI.
computer science, information systems, artificial intelligence,engineering, electrical & electronic