To Distill or Not To Distill: Towards Fast, Accurate and Communication Efficient Federated Distillation Learning

Yuan Zhang,Wenlong Zhang,Lingjun Pu,Tao Lin,Jinyao Yan
DOI: https://doi.org/10.1109/jiot.2023.3324666
IF: 10.6
2023-01-01
IEEE Internet of Things Journal
Abstract:Apart from the promising potential, Federated Learning (FL) faces challenges such as high communication costs and client heterogeneity. Although numerous works have been proposed to address these issues, they lack a holistic perspective to balance all requirements. Moreover, these solutions have not fully utilized the underlying computation capability and network resources, resulting in sub-optimal trade-offs between communication efficiency and inference accuracy. To overcome these challenges, we propose FDL: a Federated Distillation Learning framework that combines Federated Distillation (FD) and Federated Learning (FL) to fully utilize computation and network resources. We theoretically prove the convergence bound of the proposed FDL framework. Furthermore, to minimize the training time while maintaining inference accuracy, we design HAD: a Heterogeneity-Aware FL/FD selection algorithm that determines the total communication rounds and selects the set of FL and FD nodes in each communication round. The optimality of HAD is also theoretically proved. The FDL framework and HAD algorithm together minimize the training time while satisfying the inference accuracy in a heterogeneous and dynamic environment. Extensive experiments on various learning algorithms and datasets show that the proposed FDL-HAD solution can obtain the optimal selection decision in overwhelmingly less selection time compared with Gurobi solver and can reduce the overall training time by at least 44.8% compared with FL solutions with the same inference accuracy.
computer science, information systems,telecommunications,engineering, electrical & electronic
What problem does this paper attempt to address?