Abstract:With powerful parallel computing GPUs and massive user data, neural-network-based deep learning can well exert its strong power in problem modeling and solving, and has archived great success in many applications such as image classification, speech recognition and machine translation etc. While deep learning has been increasingly popular, the problem of privacy leakage becomes more and more urgent. Given the fact that the training data may contain highly sensitive information, e.g., personal medical records, directly sharing them among the users (i.e., participants) or centrally storing them in one single location may pose a considerable threat to user privacy. In this paper, we present a practical privacy-preserving collaborative deep learning system that allows users to cooperatively build a collective deep learning model with data of all participants, without direct data sharing and central data storage. In our system, each participant trains a local model with their own data and only shares model parameters with the others. To further avoid potential privacy leakage from sharing model parameters, we use functional mechanism to perturb the objective function of the neural network in the training process to achieve $\epsilon $ -differential privacy. In particular, for the first time, we consider the existence of unreliable participants , i.e., the participants with low-quality data, and propose a solution to reduce the impact of these participants while protecting their privacy. We evaluate the performance of our system on two well-known real-world datasets for regression and classification tasks. The results demonstrate that the proposed system is robust against unreliable participants, and achieves high accuracy close to the model trained in a traditional centralized manner while ensuring rigorous privacy protection.

Privacy-Preserving Multiparty Learning For Logistic Regression

Privacy-Preserving Collaborative Deep Learning with Unreliable Participants.

Privacy Preserving PCA for Multiparty Modeling

Privacy-preserving multi-party logistic regression in cloud computing

Privacy-preserving two-parties logistic regression on vertically partitioned data using asynchronous gradient sharing

Privacy-preserving logistic regression with secret sharing

Efficient Privacy Preserving Logistic Regression for Horizontally Distributed Data

Guaranteed Distributed Machine Learning: Privacy-preserving Empirical Risk Minimization

Supporting Regularized Logistic Regression Privately and Efficiently

Distributed Logistic Regression with Differential Privacy

Online Efficient Secure Logistic Regression based on Function Secret Sharing

EPoLORE: Efficient and Privacy Preserved Logistic Regression Scheme.

Insuring against the perils in distributed learning: privacy-preserving empirical risk minimization

A privacy-preserving decentralized credit scoring method based on multi-party information

Differential Privacy-preserving Distributed Machine Learning

Transfer Learning for Logistic Regression with Differential Privacy

Privacy-preserving Logistic Regression with Improved Efficiency

Privacy-Preserving Vertical Collaborative Logistic Regression without Trusted Third-Party Coordinator

Privacy-preserving Distributed Machine Learning Via Local Randomization and ADMM Perturbation

Multiparty Dual Learning

PrivColl: Practical Privacy-Preserving Collaborative Machine Learning