Toward the Application of Differential Privacy to Data Collaboration

Hiromi Yamashiro,Kazumasa Omote,Akira Imakura,Tetsuya Sakurai
DOI: https://doi.org/10.1109/access.2024.3396146
IF: 3.9
2024-05-10
IEEE Access
Abstract:Federated Learning, a model-sharing method, and Data Collaboration, a non-model-sharing method, are recognized as data analysis methods for distributed data. In Federated Learning, clients send only the parameters of a machine learning model to the central server. In Data Collaboration, clients send data that has undergone irreversibly transformed through dimensionality reduction to the central server. Both methods are designed with privacy concerns, but privacy is not guaranteed. Differential Privacy, a theoretical and quantitative privacy criterion, has been applied to Federated Learning to achieve rigorous privacy preservation. In this paper, we introduce a novel method using PCA (Principal Component Analysis) that finds low-rank approximation of a matrix preserving the variance, aiming to apply Differential Privacy to Data Collaboration. Experimental evaluation using the proposed method show that differentially-private Data Collaboration achieves comparable performance to differentially-private Federated Learning.
computer science, information systems,telecommunications,engineering, electrical & electronic
What problem does this paper attempt to address?