Differentially private and decentralized randomized power method

Julien Nicolas,César Sabater,Mohamed Maouche,Sonia Ben Mokhtar,Mark Coates
2024-11-04
Abstract:The randomized power method has gained significant interest due to its simplicity and efficient handling of large-scale spectral analysis and recommendation tasks. As modern datasets contain sensitive private information, we need to give formal guarantees on the possible privacy leaks caused by this method. This paper focuses on enhancing privacy preserving variants of the method. We propose a strategy to reduce the variance of the noise introduced to achieve Differential Privacy (DP). We also adapt the method to a decentralized framework with a low computational and communication overhead, while preserving the accuracy. We leverage Secure Aggregation (a form of Multi-Party Computation) to allow the algorithm to perform computations using data distributed among multiple users or devices, without revealing individual data. We show that it is possible to use a noise scale in the decentralized setting that is similar to the one in the centralized setting. We improve upon existing convergence bounds for both the centralized and decentralized versions. The proposed method is especially relevant for decentralized applications such as distributed recommender systems, where privacy concerns are paramount.
Machine Learning,Cryptography and Security,Numerical Analysis
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is the lack of privacy protection in the existing Randomized Power Method when dealing with large - scale spectral analysis and recommendation tasks. Specifically: 1. **Privacy Leakage Problem**: Modern datasets usually contain sensitive personal information, and the traditional Randomized Power Method cannot provide sufficient privacy protection. Therefore, a formal privacy - protection mechanism needs to be introduced to ensure that the algorithm output does not leak too much individual information. 2. **Noise Introduction and Performance Balance**: In order to achieve Differential Privacy (DP), noise needs to be introduced into the calculation process. However, the introduction of noise in existing methods will lead to a large variance, thus affecting the accuracy and efficiency of the algorithm. This paper proposes a strategy to reduce the noise variance introduced to achieve differential privacy, so as to improve the algorithm performance. 3. **Application in Decentralized Environments**: In decentralized environments, such as distributed recommendation systems, data is distributed among multiple users or devices, and communication is limited by a predefined communication graph. Most of the existing methods rely on a centralized setting, that is, assuming that there is a trusted manager holding all data, which is not always feasible in practical applications. This paper proposes a method suitable for the decentralized framework, and performs calculations without exposing personal data through techniques such as Secure Aggregation. 4. **Convergence Analysis**: Existing methods have limitations in terms of the number of singular vectors calculated, data distribution assumptions, etc., which lead to limitations in their performance and privacy protection. This paper improves the convergence bound, reduces the sensitivity to the number of singular vectors, and provides new privacy proofs, correcting the errors in previous works. In summary, this paper aims to improve the privacy and computational efficiency of the Randomized Power Method when dealing with large - scale data by introducing an improved privacy - protection mechanism and a method suitable for the decentralized environment.