Abstract:There exists a growing concern about the privacy threats inherently encoded in the increasingly available behavioral data. In this paper, we focus on online service providers over which users reveal preferences (e.g., ratings or reviews) as publicly available behavioral data. We model users' strategic interactions in making these decisions as quadratic games on networks, where a user's payoff depends not only on their actions but also on their neighbors in a social interaction network. Based on the assumption that the observed preferences correspond to the Nash equilibrium of the game, we investigate whether eavesdroppers or competitors having access to such behavioral data could potentially infer or even thoroughly learn the hidden social interaction network from the observed user actions, leading to privacy risks at a system level for the platform. We introduce three notions of privacy risks on networks: fundamental privacy, conditional privacy, and practical privacy. We establish necessary and sufficient conditions for fundamental and conditional privacy of the social interaction structure, suggesting that at the system level, such interaction structure is not facing a critical risk of being fully exposed regardless of the amount of data available. To study practical privacy, we further propose and analyze a novel learning framework by solving a joint optimization problem for the network structure and the individual marginal benefits in the game. Extensive experiments on synthetic and real-world data demonstrate the effectiveness of the proposed framework, hence illustrating that behavioral data indeed present substantial privacy risks in practice. Broadly, this study enriches the network privacy literature and provides implications for online platforms by revealing the unexpected consequence of leaking network connections due to public consumer revealed preferences.

Privacy-Accuracy Trade-Off in Distributed Data Mining

Privacy-Accuracy Trade-Off in Differentially-Private Distributed Classification: A Game Theoretical Approach

Privacy Preserving Distributed Classification: A Satisfaction Equilibrium Approach

Privacy Preserving Distributed DBSCAN Clustering

Privacy Risks of Social Interaction Structure: Network Learning in Quadratic Games

PEM: A Practical Differentially Private System for Large-Scale Cross-Institutional Data Mining.

Privacy-Preserving Distributed Optimization and Learning

Differentially Private Nash Equilibrium Seeking in Quadratic Network Games

Information Security in Big Data: Privacy and Data Mining

Two-Party Privacy Games: How Users Perturb When Learners Preempt

Privacy-preserving algorithms for distributed mining of frequent itemsets

A Privacy Protection Model of Data Publication Based on Game Theory

Quantum Privacy-Preserving Data Mining

The Conflict Between Big Data and Individual Privacy

Differential Privacy-preserving Distributed Machine Learning

A Privacy-Preserving Classification Mining Algorithm

Privacy-Preserving Classification of Customer Data Without Loss of Accuracy

Utility–Privacy Trade-Off in Distributed Machine Learning Systems

Not one but many Tradeoffs: Privacy Vs. Utility in Differentially Private Machine Learning

Privacy-Preserving Data Collecting: A Simple Game Theoretic Approach