Rényi Differential Privacy of the Sampled Gaussian Mechanism

Ilya Mironov,Kunal Talwar,Li Zhang
DOI: https://doi.org/10.48550/arXiv.1908.10530
2019-08-28
Abstract:The Sampled Gaussian Mechanism (SGM)---a composition of subsampling and the additive Gaussian noise---has been successfully used in a number of machine learning applications. The mechanism's unexpected power is derived from privacy amplification by sampling where the privacy cost of a single evaluation diminishes quadratically, rather than linearly, with the sampling rate. Characterizing the precise privacy properties of SGM motivated development of several relaxations of the notion of differential privacy. This work unifies and fills in gaps in published results on SGM. We describe a numerically stable procedure for precise computation of SGM's Rényi Differential Privacy and prove a nearly tight (within a small constant factor) closed-form bound.
Machine Learning,Cryptography and Security
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to accurately characterize the Rényi differential privacy (RDP) properties of the Sampled Gaussian Mechanism (SGM). Specifically, SGM combines subsampling and additive Gaussian noise and is widely used in machine learning. However, an accurate description of its privacy properties has always been a challenge. ### Main problems of the paper 1. **Accurately calculate the Rényi differential privacy of SGM**: Existing research has certain limitations when dealing with the privacy properties of SGM, especially in terms of numerical stability and theoretical analysis. This paper aims to provide a numerically stable calculation method and derive an almost tight closed - form upper bound. 2. **Unify and fill the gaps in existing results**: Previous research on SGM has used different relaxed versions of the differential privacy definition, such as Concentrated Differential Privacy (CDP), zero - Concentrated Differential Privacy (zero - CDP), etc. This paper attempts to unify these results and fill the gaps among them. ### Specific contributions - **Numerically stable process**: Proposed a numerically stable process to accurately calculate the Rényi differential privacy of SGM. - **Almost tight closed - form upper bound**: Proved an almost tight closed - form upper bound, which is optimal within a small constant factor. - **Theoretical analysis and comparison**: Through theoretical analysis, compared different versions of the differential privacy definition (such as CDP, zCDP, tCDP, and RDP), and showed the relationships between them. ### Key formulas The key formulas involved in the paper include: 1. **Rényi divergence**: \[ D_\alpha(P \| Q)=\frac{1}{\alpha - 1}\ln\int_X q(x)\left(\frac{p(x)}{q(x)}\right)^\alpha dx \] where \( p(x) \) and \( q(x) \) are the density functions of distributions \( P \) and \( Q \), respectively. 2. **Rényi differential privacy (RDP)**: \[ D_\alpha(M(S)\|M(S'))\leq\epsilon \] for any two adjacent datasets \( S \) and \( S' \), where \( S' = S\cup\{x\} \) or vice versa. 3. **Sampling Gaussian mechanism (SGM)**: \[ SG_{q,\sigma}(S)=f(\{x:x\in S \text{ is sampled with probability } q\})+N(0,\sigma^2 I_d) \] where each element is independently sampled with probability \( q \) and \( d \)-dimensional spherical Gaussian noise \( N(0,\sigma^2 I_d) \) is added. ### Conclusion Through the above methods, the paper provides more accurate and stable privacy analysis tools for SGM, which helps to better understand and control privacy costs in practical applications.