Abstract:Federated learning is an emerging research paradigm for enabling collaboratively training deep learning models without sharing patient data. However, the data from different institutions are usually heterogeneous across institutions, which may reduce the performance of models trained using federated learning. In this study, we propose a novel heterogeneity-aware federated learning method, SplitAVG, to overcome the performance drops from data heterogeneity in federated learning. Unlike previous federated methods that require complex heuristic training or hyper parameter tuning, our SplitAVG leverages the simple network split and feature map concatenation strategies to encourage the federated model training an unbiased estimator of the target data distribution. We compare SplitAVG with seven state-of-the-art federated learning methods, using centrally hosted training data as the baseline on a suite of both synthetic and real-world federated datasets. We find that the performance of models trained using all the comparison federated learning methods degraded significantly with the increasing degrees of data heterogeneity. In contrast, SplitAVG method achieves comparable results to the baseline method under all heterogeneous settings, that it achieves 96.2% of the accuracy and 110.4% of the mean absolute error obtained by the baseline in a diabetic retinopathy binary classification dataset and a bone age prediction dataset, respectively, on highly heterogeneous data partitions. We conclude that SplitAVG method can effectively overcome the performance drops from variability in data distributions across institutions. Experimental results also show that SplitAVG can be adapted to different base convolutional neural networks (CNNs) and generalized to various types of medical imaging tasks. The code is publicly available at https://github.com/zm17943/SplitAVG.

An Experimental Study of Data Heterogeneity in Federated Learning Methods for Medical Imaging

On the Impact of Data Heterogeneity in Federated Learning Environments with Application to Healthcare Networks

Federated Learning for Data and Model Heterogeneity in Medical Imaging

A Federated Learning Framework Via Decentralized Data Valuation for Chronic Disease Healthcare

Federated Ophthalmic Disease Diagnosis Based on Weighted Averaging and Local Optimization

SplitAVG: A Heterogeneity-Aware Federated Deep Learning Method for Medical Imaging

Robust Federated Learning for Heterogeneous Model and Data

Tackling Data Heterogeneity in Federated Learning via Loss Decomposition

Tackling heterogeneity in medical federated learning via aligning vision transformers

FedSLD: Federated Learning with Shared Label Distribution for Medical Image Classification

FDQM: Four-Dimensional Quantitative Measure for Statistical Heterogeneity in Federated Learning

Towards Addressing Heterogeneity Of Data In Federated Learning

Federated Learning for Medical Image Analysis: A Survey

Medical Federated Model with Mixture of Personalized and Sharing Components

Low prevalence of expression of p53 oncoprotein in oral carcinomas from Sri Lanka associated with betel and tobacco chewing.

Dealing With Heterogeneous 3D MR Knee Images: A Federated Few-Shot Learning Method With Dual Knowledge Distillation

Privacy preserving federated learning for full heterogeneity

Adaptive Personlization in Federated Learning for Highly Non-i.i.d. Data

Towards Fair and Privacy Preserving Federated Learning for the Healthcare Domain

Federated learning in medicine: facilitating multi-institutional collaborations without sharing patient data

A New Perspective to Boost Performance Fairness for Medical Federated Learning