Robust Federated Learning Based on Metrics Learning and Unsupervised Clustering for Malicious Data Detection

Jiaming Li,Xinyue Zhang,Liang Zhao
DOI: https://doi.org/10.1145/3476883.3520221
2022-01-01
Abstract:Federated Learning has emerged as a new paradigm for improving communication efficiency and data privacy in various machine learning tasks. It allows the distributed devices to train the model collaboratively using their local dataset only. However, correctly labeled training data is a precondition for generating a high-quality model, whereas the real-world scenario usually cannot promise this condition. Conventional countermeasures mainly detect the corrupt local update and preclude them from the global weights aggregation phase to mitigate the impact of malicious data. Instead of discarding the weights update of clients, we propose a novel robust federated learning method that utilizes Metrics Learning to encode the local data and leverages the unsupervised clustering method K-means to preclude malicious data during local training. Therefore, correctly labeled data still contribute to the global model weight update with that the global model tends to be more generic. We evaluate the proposed method on two public image classification datasets, Fashion-MNIST and CIFAR-10. The simulation results demonstrate that the proposed scheme is robust for performing federated learning in the presence of malicious data.
What problem does this paper attempt to address?