Abstract:Many data mining tasks rely on graphs to model relational structures among individuals (nodes). Since relational data are often sensitive, there is an urgent need to evaluate the privacy risks in graph data. One famous privacy attack against data analysis models is the model inversion attack, which aims to infer sensitive data in the training dataset and leads to great privacy concerns. Despite its success in grid-like domains, directly applying model inversion attacks on non-grid domains such as graph leads to poor attack performance. This is mainly due to the failure to consider the unique properties of graphs. To bridge this gap, we conduct a systematic study on model inversion attacks against Graph Neural Networks (GNNs), one of the state-of-the-art graph analysis tools in this paper. First, in the white-box setting where the attacker has full access to the target GNN model, we present GraphMI to infer the private training graph data. Specifically in GraphMI, a projected gradient module is proposed to tackle the discreteness of graph edges and preserve the sparsity and smoothness of graph features; a graph auto-encoder module is used to efficiently exploit graph topology, node attributes, and target model parameters for edge inference; a random sampling module can finally sample discrete edges. Furthermore, in the hard-label black-box setting where the attacker can only query the GNN API and receive the classification results, we propose two methods based on gradient estimation and reinforcement learning (RL-GraphMI). With the proposed methods, we study the connection between model inversion risk and edge influence and show that edges with greater influence are more likely to be recovered. Extensive experiments over several public datasets demonstrate the effectiveness of our methods. We also evaluate our attacks under two defenses: one is the well-designed differential private training, and the other is graph preprocessing. Our experimental results show that such defenses are not sufficiently effective and call for more advanced defenses against privacy attacks.

Neural Network Inversion in Adversarial Setting Via Background Knowledge Alignment

Adversarial Neural Network Inversion via Auxiliary Knowledge Alignment

The Secret Revealer: Generative Model-Inversion Attacks Against Deep Neural Networks

A GAN-Based Defense Framework Against Model Inversion Attacks.

NetGuard: Protecting Commercial Web APIs from Model Inversion Attacks Using GAN-generated Fake Samples

Network Inversion and Its Applications

Inversion-guided Defense: Detecting Model Stealing Attacks by Output Inverting

Improving Query Efficiency of Black-box Adversarial Attack

The Role of Class Information in Model Inversion Attacks Against Image Deep Learning Classifiers

Reinforcement Learning-Based Black-Box Model Inversion Attacks

Landscape Learning for Neural Network Inversion

Model Inversion Attack against Transfer Learning: Inverting a Model without Accessing It

Boosting Model Inversion Attacks with Adversarial Examples

Variational Model Inversion Attacks

Privacy Leakage on DNNs: A Survey of Model Inversion Attacks and Defenses

Isolation and Induction: Training Robust Deep Neural Networks against Model Stealing Attacks

Network Inversion of Convolutional Neural Nets

The Enemy of My Enemy is My Friend: Exploring Inverse Adversaries for Improving Adversarial Training

Re-thinking Model Inversion Attacks Against Deep Neural Networks

InverseNet: Augmenting Model Extraction Attacks with Training Data Inversion

Model Inversion Attacks Against Graph Neural Networks