Abstract:Federated learning is a distributed machine learning framework. It facilitates collaborative modeling among participants without sharing raw user data, providing a feasible solution to address data silos in a secure and privacy-preserving manner. However, data heterogeneity is a major challenge of federated learning as it greatly impacts both the convergence speed during model training and the accuracy of predictions. To address this issue, recent federated learning algorithms have incorporated knowledge distillation as a approach of information sharing. Nevertheless, the majority of current approaches only employ logits averaging to combine participants' knowledge, which can have a detrimental impact on the accuracy of the global model, particularly when certain local models show poor performance. besides, some methods rely on public datasets, thereby compromising the privacy-preserving principle of federated learning. To address the aforementioned concerns, we introduces a new federated learning algorithm named Federated Adaptive Knowledge Distillation (FedAdKD), which utilizes knowledge distillation. After the server completes the aggregation of the global model, FedAdKD dynamically allocates distillation weights based on the individual local model's loss on the public dataset, and performs knowledge distillation on the global model. As the use of public datasets violates the privacy-preserving principle of federated learning, we also proposes the incorporation of generative models to generate data that adheres to the original image distribution. By employing federated learning on the client side to train diffusion models, data is generated that adheres to the original image distribution while maintaining privacy. This generated data is then utilized as the distillation dataset. Experimental findings validate the effectiveness of FedAdKD in addressing the obstacles presented by data heterogeneity. FedAdKD not only mitigates the decline in global model accuracy caused by subpar local models and minimizes knowledge forgetting resulting from direct model aggregation but also improves the generalization capacity of the global model.

Global prototype distillation for heterogeneous federated learning

FedDGP: Disentangling Global and Personal Models for Federated Learning

Local-Global Knowledge Distillation in Heterogeneous Federated Learning with Non-IID Data

Understanding the Training Dynamics in Federated Deep Learning via Aggregation Weight Optimization

FedGKD: Towards Heterogeneous Federated Learning via Global Knowledge Distillation

A Prototype-Based Knowledge Distillation Framework for Heterogeneous Federated Learning

Federated Virtual Learning on Heterogeneous Data with Local-global Distillation

Fedadkd:heterogeneous federated learning via adaptive knowledge distillation

FedMD: Heterogenous Federated Learning via Model Distillation

FedDistill: Global Model Distillation for Local Model De-Biasing in Non-IID Federated Learning

Fine-tuning Global Model Via Data-Free Knowledge Distillation for Non-IID Federated Learning

FedTGP: Trainable Global Prototypes with Adaptive-Margin-Enhanced Contrastive Learning for Data and Model Heterogeneity in Federated Learning

Unlocking the Potential of Federated Learning: The Symphony of Dataset Distillation via Deep Generative Latents

Bidirectional Decoupled Distillation for Heterogeneous Federated Learning

The Best of Both Worlds: Accurate Global and Personalized Models through Federated Learning with Data-Free Hyper-Knowledge Distillation

FedDKD: Federated Learning with Decentralized Knowledge Distillation

Handling Data Heterogeneity in Federated Learning via Knowledge Distillation and Fusion

Data-Free Knowledge Distillation for Heterogeneous Federated Learning

Towards Personalized Federated Learning via Comprehensive Knowledge Distillation

Tackling Data Heterogeneity in Federated Learning with Class Prototypes