Distributed Modelling Approaches for Data Privacy Preserving

Chao Wu,Fengda Zhang,Fei Wu
DOI: https://doi.org/10.1109/BigMM.2019.00016
2019-01-01
Abstract:Recently, machine learning has been developing rapidly. There is no doubt that data plays an important role in machine learning. However, it is hard to make full use of the data from a large amount of nodes to collaboratively train a good model with data privacy preserving. In this paper, we study and analyze several decentralized machine learning algorithms regarding to privacy protection, and propose a smart contract-based decentralized federated learning algorithm. We also propose a decentralized topology-based machine learning algorithm to solve the problems caused by star-topology network. Based on it, we further present a novel method of model aggregation based on distillation to break the conventional constrain of federated learning the models of different nodes shall have the same network structure. We also use several methods to generate synthetic dataset from raw dataset to train models with data privacy protected. Finally, we analyze and compare different distributed machine learning algorithms through the experiments.
What problem does this paper attempt to address?