Abstract:Federated Learning (FL) is a collaborative training paradigm whereby a global Machine Learning (ML) model is trained using typically private and distributed data sources without disclosing the raw data. The approach paves the way for better privacy guarantees, improved overall system scalability, and sustainability. In this context, Federated Averaging (FedAvg) is a representative FL algorithm adopting a client–server protocol that operates in synchronous rounds, where selected learners contribute to the global model via local model updates, trained using their private data, while a server entity aggregates the local contributions, producing the new-generation global model as a weighted average of the local ones. However, when clients possess (highly) dissimilar data, the FedAvg technique becomes ineffective due to divergence in client models. Consequently, FedAvg-trained models struggle to generalize when presented with unseen data from the global distribution. In this research paper, we conduct a systematic review of state-of-the-art approaches proposed to counteract global model performance degradation in the presence of heterogeneous data. To this end, we compile an original taxonomy, highlighting the main algorithmic approaches and mechanisms behind each identified category. Advancing the current body of knowledge, we empirically evaluate the generalization performance on visual tasks of various methods under moderate and significant levels of data heterogeneity, as common practice within the surveyed literature. In addition, the paper benchmarks the performance of hybrid techniques, resulting as a combination of client- and server-side algorithmic tweaks, by shedding light on some associated performance tradeoffs. While recognizing other relevant issues in FL, such as device heterogeneity and energy consumption, which have a non-negligible impact on the learning process, these well-investigated topics are not the main focus of this article.

Exploiting Feature Heterogeneity for Improved Generalization in Federated Multi-task Learning

Exploiting High-Order Information in Heterogeneous Multi-Task Feature Learning.

Federated Multi-Task Learning under a Mixture of Distributions

FedHCA$^2$: Towards Hetero-Client Federated Multi-Task Learning

Learning to Generalize in Heterogeneous Federated Networks

A New Look and Convergence Rate of Federated Multi-Task Learning with Laplacian Regularization

Non-Federated Multi-Task Split Learning for Heterogeneous Sources

A New Look and Convergence Rate of Federated Multitask Learning With Laplacian Regularization

Towards Efficient Model-Heterogeneity Federated Learning for Large Models

Towards Addressing Heterogeneity Of Data In Federated Learning

FedDGA: Federated Multi-Task Learning Based on Dynamic Guided Attention

Federated Mutual Learning: a Collaborative Machine Learning Method for Heterogeneous Data, Models, and Objectives

FedL2G: Learning to Guide Local Training in Heterogeneous Federated Learning

Federated Model Heterogeneous Matryoshka Representation Learning

Heterogeneous Federated Learning via Grouped Sequential-to-Parallel Training

Federated Multi-Task Learning with Non-Stationary and Heterogeneous Data in Wireless Networks

Privacy preserving federated learning for full heterogeneity

FedTSA: A Cluster-based Two-Stage Aggregation Method for Model-heterogeneous Federated Learning

FedPer++: Toward Improved Personalized Federated Learning on Heterogeneous and Imbalanced Data

Completely Heterogeneous Federated Learning

Enhancing generalization in federated learning with heterogeneous data: A comparative literature review