The Value of Privacy: Strategic Data Subjects, Incentive Mechanisms and Fundamental Limits

Weina Wang,Lei Ying,Junshan Zhang
DOI: https://doi.org/10.48550/arXiv.1603.06870
2016-03-23
Abstract:We study the value of data privacy in a game-theoretic model of trading private data, where a data collector purchases private data from strategic data subjects (individuals) through an incentive mechanism. The private data of each individual represents her knowledge about an underlying state, which is the information that the data collector desires to learn. Different from most of the existing work on privacy-aware surveys, our model does not assume the data collector to be trustworthy. Then, an individual takes full control of its own data privacy and reports only a privacy-preserving version of her data. In this paper, the value of $\epsilon$ units of privacy is measured by the minimum payment of all nonnegative payment mechanisms, under which an individual's best response at a Nash equilibrium is to report the data with a privacy level of $\epsilon$. The higher $\epsilon$ is, the less private the reported data is. We derive lower and upper bounds on the value of privacy which are asymptotically tight as the number of data subjects becomes large. Specifically, the lower bound assures that it is impossible to use less amount of payment to buy $\epsilon$ units of privacy, and the upper bound is given by an achievable payment mechanism that we designed. Based on these fundamental limits, we further derive lower and upper bounds on the minimum total payment for the data collector to achieve a given learning accuracy target, and show that the total payment of the designed mechanism is at most one individual's payment away from the minimum.
Computer Science and Game Theory,Cryptography and Security
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to study the value of data privacy in a game - theory model, especially in the process of trading private data, where data collectors purchase private data from strategic data subjects (individuals) through incentive mechanisms. Each individual's private data represents her knowledge of a certain latent state, and this state is exactly the information that the data collector wants to know. Unlike most existing works, this model does not assume that the data collector is trustworthy. Therefore, individuals have complete control over their own data privacy and only report the privacy - protected version of the data. Specifically, the paper defines a measure of the value of privacy per unit \(\epsilon\), that is, under all non - negative payment mechanisms, the minimum payment required to make an individual's optimal response in Nash equilibrium be to report data with a privacy level of \(\epsilon\). The higher \(\epsilon\) is, the less private the reported data is. The paper derives asymptotically tight upper and lower bounds on the value of privacy as the number of data subjects increases. In particular, the lower bound guarantees that it is impossible to use less payment to buy \(\epsilon\) units of privacy, and the upper bound is given by an achievable payment mechanism designed. Based on these fundamental limitations, the paper further derives upper and lower bounds on the minimum total payment required for data collectors to achieve a given learning accuracy goal, and shows that the total payment of the designed mechanism is at most one individual's payment more than the minimum. In addition, the paper also explores the strategic characteristics of individuals in Nash equilibrium and finds that an individual's strategy is either a symmetric random response or a non - informative strategy, which helps to gain a deep understanding of the behavior of privacy - conscious individuals. Through these analyses, the paper provides a theoretical basis for understanding and quantifying the value of data privacy in the market.