Abstract:Federated learning is a distributed machine learning framework. It facilitates collaborative modeling among participants without sharing raw user data, providing a feasible solution to address data silos in a secure and privacy-preserving manner. However, data heterogeneity is a major challenge of federated learning as it greatly impacts both the convergence speed during model training and the accuracy of predictions. To address this issue, recent federated learning algorithms have incorporated knowledge distillation as a approach of information sharing. Nevertheless, the majority of current approaches only employ logits averaging to combine participants' knowledge, which can have a detrimental impact on the accuracy of the global model, particularly when certain local models show poor performance. besides, some methods rely on public datasets, thereby compromising the privacy-preserving principle of federated learning. To address the aforementioned concerns, we introduces a new federated learning algorithm named Federated Adaptive Knowledge Distillation (FedAdKD), which utilizes knowledge distillation. After the server completes the aggregation of the global model, FedAdKD dynamically allocates distillation weights based on the individual local model's loss on the public dataset, and performs knowledge distillation on the global model. As the use of public datasets violates the privacy-preserving principle of federated learning, we also proposes the incorporation of generative models to generate data that adheres to the original image distribution. By employing federated learning on the client side to train diffusion models, data is generated that adheres to the original image distribution while maintaining privacy. This generated data is then utilized as the distillation dataset. Experimental findings validate the effectiveness of FedAdKD in addressing the obstacles presented by data heterogeneity. FedAdKD not only mitigates the decline in global model accuracy caused by subpar local models and minimizes knowledge forgetting resulting from direct model aggregation but also improves the generalization capacity of the global model.

To Distill or Not To Distill: Towards Fast, Accurate and Communication Efficient Federated Distillation Learning

FedDGP: Disentangling Global and Personal Models for Federated Learning

Communication-Efficient Federated Distillation with Active Data Sampling

Federated Distillation: A Survey

Towards Secure and Robust Federated Distillation in Distributed Cloud: Challenges and Design Issues

FedDW: Distilling Weights through Consistency Optimization in Heterogeneous Federated Learning

Federated Virtual Learning on Heterogeneous Data with Local-global Distillation

DistDD: Distributed Data Distillation Aggregation through Gradient Matching

Improving Communication Efficiency of Federated Distillation via Accumulating Local Updates

Bidirectional Decoupled Distillation for Heterogeneous Federated Learning

Fedadkd:heterogeneous federated learning via adaptive knowledge distillation

FedKD: Communication Efficient Federated Learning Via Knowledge Distillation

One-shot Federated Learning via Synthetic Distiller-Distillate Communication

DFRD: Data-Free Robustness Distillation for Heterogeneous Federated Learning

FedDistill: Global Model Distillation for Local Model De-Biasing in Non-IID Federated Learning

FedRAD: Heterogeneous Federated Learning via Relational Adaptive Distillation

Global prototype distillation for heterogeneous federated learning

FedMD: Heterogenous Federated Learning via Model Distillation

Ensemble Distillation for Robust Model Fusion in Federated Learning

Convergence Visualizer of Decentralized Federated Distillation with Reduced Communication Costs

Data-Free Knowledge Distillation for Heterogeneous Federated Learning