FedH2L: A Federated Learning Approach with Model and Statistical Heterogeneity

Yiying Li,Wei Zhou,Haibo Mi,Yijie Wang,Huaimin Wang
DOI: https://doi.org/10.1109/jcc59055.2023.00009
2023-01-01
Abstract:Federated learning (FL) enables distributed participants to collectively learn a strong global model without sacrificing their individual data privacy. Mainstream FL approaches require each participant to share a common network architecture and further assume that data are sampled IID across participants. However, in real-world deployments, participants may require heterogeneous network architectures; and the data distribution is almost non-uniform. To address these issues we introduce FedH2L, which is agnostic to the model architecture and robust to different data distributions across participants. In contrast to approaches sharing parameters or gradients, FedH2L relies on mutual distillation, exchanging only posteriors on a shared seed set between participants in a decentralized manner. This makes it extremely bandwidth efficient, model agnostic, and crucially produces models capable of performing well on the whole data distribution when learning from heterogeneous silos.
What problem does this paper attempt to address?