Abstract:Differential privacy (DP), as a rigorous mathematical definition quantifying privacy leakage, has become a well-accepted standard for privacy protection. Combined with powerful machine learning (ML) techniques, differentially private machine learning (DPML) is increasingly important. As the most classic DPML algorithm, DP-SGD incurs a significant loss of utility, which hinders DPML's deployment in practice. Many studies have recently proposed improved algorithms based on DP-SGD to mitigate utility loss. However, these studies are isolated and cannot comprehensively measure the performance of improvements proposed in algorithms. More importantly, there is a lack of comprehensive research to compare improvements in these DPML algorithms across utility, defensive capabilities, and generalizability. We fill this gap by performing a holistic measurement of improved DPML algorithms on utility and defense capability against membership inference attacks (MIAs) on image classification tasks. We first present a taxonomy of where improvements are located in the ML life cycle. Based on our taxonomy, we jointly perform an extensive measurement study of the improved DPML algorithms, over twelve algorithms, four model architectures, four datasets, two attacks, and various privacy budget configurations. We also cover state-of-the-art label differential privacy (Label DP) algorithms in the evaluation. According to our empirical results, DP can effectively defend against MIAs, and sensitivity-bounding techniques such as per-sample gradient clipping play an important role in defense. We also explore some improvements that can maintain model utility and defend against MIAs more effectively. Experiments show that Label DP algorithms achieve less utility loss but are fragile to MIAs. ML practitioners may benefit from these evaluations to select appropriate algorithms. To support our evaluation, we implement a modular re-usable software, DPMLBench,(1) which enables sensitive data owners to deploy DPML algorithms and serves as a benchmark tool for researchers and practitioners.

Differential Privacy-preserving Distributed Machine Learning

Privacy-preserving Distributed Machine Learning Via Local Randomization and ADMM Perturbation

DPMLBench: Holistic Evaluation of Differentially Private Machine Learning

Differentially Private Robust ADMM for Distributed Machine Learning.

Privacy preserving distributed machine learning with federated learning

Locally Differentially Private Distributed Online Learning with Guaranteed Optimality

Utility–Privacy Trade-Off in Distributed Machine Learning Systems

A(DP)$^2$SGD: Asynchronous Decentralized Parallel Stochastic Gradient Descent with Differential Privacy

Efficient Privacy-Preserving Machine Learning for Blockchain Network

Differentially Private Support Vector Machines with Knowledge Aggregation

Secure and Differentially Private Bayesian Learning on Distributed Data

Insuring against the perils in distributed learning: privacy-preserving empirical risk minimization

Distributed Machine Learning Oriented Data Integrity Verification Scheme in Cloud Computing Environment

DMM: Distributed Matrix Mechanism for Differentially-Private Federated Learning using Packed Secret Sharing

Privacy-preserving Decentralized Deep Learning with Multiparty Homomorphic Encryption

Scalable Differential Privacy Mechanisms for Real-Time Machine Learning Applications

Differentially-Private Distributed Model Predictive Control of Linear Discrete-Time Systems with Global Constraints

An Adaptive and Fast Convergent Approach to Differentially Private Deep Learning

Privacy-Preserving Multiparty Learning For Logistic Regression

Privacy-Accuracy Trade-Off in Differentially-Private Distributed Classification: A Game Theoretical Approach