Public Data Assisted Differential Private Deep Learning

Jiaxi Yang,Xiang Cheng
DOI: https://doi.org/10.1109/ijcnn55064.2022.9892712
2022-01-01
Abstract:Deep neural networks are capable of making classification or prediction using large amounts of labeled data. When there's no or only a few labeled data available, domain adaptation can be adopted to transfer a model from the source domain where many labeled data are available to the target domain with few labeled data. In practice, in addition to private data, some public data are generally available and can be utilized as source domain. However, due to the exposure of the training model, the information of the private data may be potentially compromised. In this paper, we propose public data assisted differentially private deep learning methods which generalize well across different distributions of the private and public data. Specifically, two different methods are presented for different scenarios: unsupervised learning and supervised learning. The proposed methods can leverage the public data for higher performance while providing privacy guarantee for the private dataset. Empirical study on standard benchmark datasets validates the superiority of our approaches.
What problem does this paper attempt to address?