Abstract:Federated learning (FL) aims to raise a multi-client collaboration approach in the case of local data isolation. In particular, the clients with non-IID data frequently participate in or leave the federated learning training process asynchronously, resulting in dynamic federated learning (DFL) scenario, which attracts more and more attention. Indeed, an effective DFL solution has to address the following two challenges: 1) Statistical Dynamics. The distributions of local data from clients are always non-IID and the global data distribution is dynamic due to the participation or departure of clients. 2) Expiration Dynamics. After clients leave the federated training process, their historical updated models have a certain validity to reuse in subsequent rounds but it is hard to quantify this validity. In this paper, we first consider clustering the clients with similar data distribution to make them much closer to IID and concentrating on the training the models in each cluster. Then we analyze the changing trend of model validity named model quality and define one suitable function to describe expiration dynamics. As a solution, we propose Dynamic Clustering Federated Learning (DCFL) framework to improve federated learning on non-IID data in DFL. Specifically, DCFL follows the client-server architecture as the standard FL. On the client side, the local devices calculate the related information of the local data distribution for client clustering. On the server side, we design two strategies for the challenges above. We propose dynamic clustering aggregation strategy (including a dynamic clustering algorithm and a two-stage aggregation) by dynamically clustering clients and then aggregating the local models to overcome Statistical Dynamics. Besides, we propose expiration memory strategy by reusing the historical models and then adjusting to model quality of historical models as the basis for model aggregation to overcome Expiration Dynamics. Finally, we conduct extensive experiments on public datasets, which demonstrate the effectiveness of the DCFL framework.

GradMFL: Gradient Memory-Based Federated Learning for Hierarchical Knowledge Transferring Over Non-IID Data

FedDGP: Disentangling Global and Personal Models for Federated Learning

Hierarchical Federated Learning with Adaptive Clustering on Non-IID Data

A Hierarchical Knowledge Transfer Framework for Heterogeneous Federated Learning.

A Survey of Federated Learning on Non-IID Data

Information-Exchangeable Hierarchical Clustering for Federated Learning with Non-IID Data

Hierarchical Federated Learning with Multi-Timescale Gradient Correction

GradMA: A Gradient-Memory-based Accelerated Federated Learning with Alleviated Catastrophic Forgetting

Dynamic Clustering Federated Learning for Non-IID Data.

FedGK: Communication-Efficient Federated Learning through Group-Guided Knowledge Distillation

HierarchyFL: Heterogeneous Federated Learning via Hierarchical Self-Distillation

A Clustered Federated Learning Method of User Behavior Analysis Based on Non-IID Data

Federated learning on non-IID and long-tailed data via dual-decoupling

Dataset Distillation-based Hybrid Federated Learning on Non-IID Data

Handling Data Heterogeneity in Federated Learning via Knowledge Distillation and Fusion

Federated Learning with Non-IID Data: A Survey

Local-Global Knowledge Distillation in Heterogeneous Federated Learning with Non-IID Data

Fine-tuning Global Model Via Data-Free Knowledge Distillation for Non-IID Federated Learning

No One Left Behind: Real-World Federated Class-Incremental Learning

Take History as a Mirror in Heterogeneous Federated Learning

MultiConfederated Learning: Inclusive Non-IID Data handling with Decentralized Federated Learning