Efficient Federated Learning for Modern NLP

Dongqi Cai,Yaozong Wu,Shangguang Wang,Felix Xiaozhu Lin,Mengwei Xu
DOI: https://doi.org/10.1145/3603165.3607380
2023-01-01
Abstract:Fine-tuning pre-trained models for downstream tasks often requires private data, for which federated learning is the de-facto approach (i.e., FedNLP). However, FedNLP is prohibitively slow due to the large model sizes and the resultant high network/computation cost. Towards practical FedNLP, we identify as the key building blocks adapters, small bottleneck modules inserted at a variety of model layers. To automate adapter configuration, we propose FedAdapter 1, a framework that enhances the existing FedNLP with progressive training and sideline trial. Extensive experiments show that FedAdapter can reduce FedNLP’s model convergence delay to no more than several hours.
What problem does this paper attempt to address?