Abstract:We study the value of data privacy in a game-theoretic model of trading private data, where a data collector purchases private data from strategic data subjects (individuals) through an incentive mechanism. The private data of each individual represents her knowledge about an underlying state, which is the information that the data collector desires to learn. Different from most of the existing work on privacy-aware surveys, our model does not assume the data collector to be trustworthy. Then, an individual takes full control of its own data privacy and reports only a privacy-preserving version of her data. In this paper, the value of $\epsilon$ units of privacy is measured by the minimum payment of all nonnegative payment mechanisms, under which an individual's best response at a Nash equilibrium is to report the data with a privacy level of $\epsilon$. The higher $\epsilon$ is, the less private the reported data is. We derive lower and upper bounds on the value of privacy which are asymptotically tight as the number of data subjects becomes large. Specifically, the lower bound assures that it is impossible to use less amount of payment to buy $\epsilon$ units of privacy, and the upper bound is given by an achievable payment mechanism that we designed. Based on these fundamental limits, we further derive lower and upper bounds on the minimum total payment for the data collector to achieve a given learning accuracy target, and show that the total payment of the designed mechanism is at most one individual's payment away from the minimum.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is to study the value of data privacy in a game - theory model, especially in the process of trading private data, where data collectors purchase private data from strategic data subjects (individuals) through incentive mechanisms. Each individual's private data represents her knowledge of a certain latent state, and this state is exactly the information that the data collector wants to know. Unlike most existing works, this model does not assume that the data collector is trustworthy. Therefore, individuals have complete control over their own data privacy and only report the privacy - protected version of the data. Specifically, the paper defines a measure of the value of privacy per unit $\epsilon$, that is, under all non - negative payment mechanisms, the minimum payment required to make an individual's optimal response in Nash equilibrium be to report data with a privacy level of $\epsilon$. The higher $\epsilon$ is, the less private the reported data is. The paper derives asymptotically tight upper and lower bounds on the value of privacy as the number of data subjects increases. In particular, the lower bound guarantees that it is impossible to use less payment to buy $\epsilon$ units of privacy, and the upper bound is given by an achievable payment mechanism designed. Based on these fundamental limitations, the paper further derives upper and lower bounds on the minimum total payment required for data collectors to achieve a given learning accuracy goal, and shows that the total payment of the designed mechanism is at most one individual's payment more than the minimum. In addition, the paper also explores the strategic characteristics of individuals in Nash equilibrium and finds that an individual's strategy is either a symmetric random response or a non - informative strategy, which helps to gain a deep understanding of the behavior of privacy - conscious individuals. Through these analyses, the paper provides a theoretical basis for understanding and quantifying the value of data privacy in the market.

The Value of Privacy: Strategic Data Subjects, Incentive Mechanisms and Fundamental Limits

Privacy Risks of Social Interaction Structure: Network Learning in Quadratic Games

Game Theoretic Data Privacy Preservation: Equilibrium And Pricing

A Theory of Pricing Private Data

Privacy-Preserving Data Collecting: A Simple Game Theoretic Approach

Privacy or Utility in Data Collection? A Contract Theoretic Approach

Privacy Can Arise Endogenously in an Economic System with Learning Agents

A Privacy Protection Model of Data Publication Based on Game Theory

On the Differential Private Data Market: Endogenous Evolution, Dynamic Pricing, and Incentive Compatibility

Dynamic Privacy Pricing: A Multi-Armed Bandit Approach with Time-Variant Rewards

An Incentive Mechanism for Trading Personal Data in Data Markets

Unpacking privacy: Valuation of personal data protection

The Privacy Paradox and Optimal Bias–Variance Trade-offs in Data Acquisition

The Strange Case of Privacy in Equilibrium Models

Pricing and disseminating customer data with privacy awareness

Buying private data without verification

Contract-Based Private Data Collecting

A Reasonable Data Pricing Mechanism for Personal Data Transactions with Privacy Concern.

Optimal Data Acquisition with Privacy-Aware Agents

Approximately Optimal Auctions for Selling Privacy when Costs are Correlated with Data

How to Balance Privacy and Money through Pricing Mechanism in Personal Data Market