Abstract:The deployment of deep learning applications has to address the increasing privacy concerns when using private and sensitive data for training. A conventional deep learning model is prone to privacy attacks that can recover the sensitive information from either model parameters or accesses to the inference model. Recently, differential privacy (DP) has been proposed to offer provable privacy guarantees by randomizing the training process of neural networks. However, many approaches tend to provide the worst case privacy protection for model publishing, inevitably impairing the accuracy of the trained models. Thus, we present a novel private knowledge transfer strategy, where the private teacher trained on sensitive data is not publicly accessible but the student models can be released with privacy guarantees. In this paper, a three-player (teacher-student-discriminator) learning framework, Private Knowledge Distillation with Generative Adversarial Networks (PKDGAN), is proposed, where the student acquires the distilled knowledge from the teacher and is trained with the discriminator to generate similar outputs as the teacher. Moreover, a cooperative learning strategy is also suggested to support the collective training of multiple students against the discriminator when each student is with insufficient unlabelled training data. To enforce rigorous privacy guarantees, PKDGAN applies a Rényi differential privacy mechanism throughout the training process, and use it with a moment accountant technique to track the privacy cost. PKDGAN allows students to be trained with unlabelled public data and very few epochs, which avoids the exposure of training data while ensuring model performance. In the experiments, PKDGAN is found to have consistently good performance on various datasets (MNIST, SVHN, CIFAR-10, and Market-1501). When compared to prior works [1], [2], PKDGAN exhibits 5-82% accuracy loss improvement without compromising any privacy guarantee.

On the Utility Recovery Incapability of Neural Net-based Differential Private Tabular Training Data Synthesizer under Privacy Deregulation

PKDGAN: Private Knowledge Distillation with Generative Adversarial Networks

Private Knowledge Transfer via Model Distillation with Generative Adversarial Networks

Generalization in Generative Adversarial Networks: A Novel Perspective from Privacy Protection.

Quantifying and Mitigating Privacy Risks for Tabular Generative Models

Training generative models from privatized data

Graphical vs. Deep Generative Models: Measuring the Impact of Differentially Private Mechanisms and Budgets on Utility

Generating tabular datasets under differential privacy

GANobfuscator: Mitigating Information Leakage under GAN Via Differential Privacy

PATE-TripleGAN: Privacy-Preserving Image Synthesis with Gaussian Differential Privacy

RDP-GAN: A Rényi-Differential Privacy Based Generative Adversarial Network

CTAB-GAN+: enhancing tabular data synthesis

Effective and Privacy preserving Tabular Data Synthesizing

Differentially Private Generative Adversarial Network

Improving Correlation Capture in Generating Imbalanced Data using Differentially Private Conditional GANs

PPGAN: Privacy-preserving Generative Adversarial Network

DPWGAN: High-Quality Load Profiles Synthesis with Differential Privacy Guarantees.

A self-attention-based differentially private tabular GAN with high data utility

Do not Let Privacy Overbill Utility: Gradient Embedding Perturbation for Private Learning

Assessment of Differentially Private Synthetic Data for Utility and Fairness in End-to-End Machine Learning Pipelines for Tabular Data

Context-Aware Generative Adversarial Privacy