Defending against gradient inversion attacks in federated learning via statistical machine unlearning

Kun Gao,Tianqing Zhu,Dayong Ye,Wanlei Zhou
DOI: https://doi.org/10.1016/j.knosys.2024.111983
IF: 8.139
2024-05-29
Knowledge-Based Systems
Abstract:Federated learning (FL) has been used as a promising approach to breaking the dilemma between the data privacy and the learning from large collections of distributed data. Without sharing data, the server trains a global model with high accuracy only by accessing the gradient of each client. However, recent research has revealed that FL is vulnerable to gradient inversion attacks, where the server can reconstruct clients' training data by inverting their uploaded gradients. Current defense methods share the common limitation that they need to either perturb the training data or the gradients. However, this perturbation will inevitably reduce the accuracy of the global model. To overcome these limitations, this paper proposes a defense method by allowing clients to train local models without using training data but rather using the statistical information of the data. Inspired by the statistical machine unlearning method, which uses statistical information to train models, combined with knowledge distillation to lightweight local models, our method further isolates the relationship between gradients and training data by transferring the model to a simpler model. Knowledge distillation typically involves extracting knowledge from a teacher model and transferring it to a student model without the need for the original training data. With this method, on the one hand, the server can reconstruct only the statistical information of the student model data, which is semantically meaningless to humans. On the other hand, using the statistical information of the data to train the global model can guarantee the model's accuracy. Experimental results show that our method outperforms existing defense methods in various aspects, including reconstruction accuracy, model accuracy and training efficiency.
computer science, artificial intelligence
What problem does this paper attempt to address?