SA-FedLora: Adaptive Parameter Allocation for Efficient Federated Learning with LoRA Tuning

Yuning Yang,Xiaohong Liu,Tianrun Gao,Xiaodong Xu,Guangyu Wang
2024-05-15
Abstract:Fine-tuning large-scale pre-trained models via transfer learning is an emerging important paradigm for a wide range of downstream tasks, with performance heavily reliant on extensive data. Federated learning (FL), as a distributed framework, provides a secure solution to train models on local datasets while safeguarding raw sensitive data. However, FL networks encounter high communication costs due to the massive parameters of large-scale pre-trained models, necessitating parameter-efficient methods. Notably, parameter efficient fine tuning, such as Low-Rank Adaptation (LoRA), has shown remarkable success in fine-tuning pre-trained models. However, prior research indicates that the fixed parameter budget may be prone to the overfitting or slower convergence. To address this challenge, we propose a Simulated Annealing-based Federated Learning with LoRA tuning (SA-FedLoRA) approach by reducing trainable parameters. Specifically, SA-FedLoRA comprises two stages: initiating and annealing. (1) In the initiating stage, we implement a parameter regularization approach during the early rounds of aggregation, aiming to mitigate client drift and accelerate the convergence for the subsequent tuning. (2) In the annealing stage, we allocate higher parameter budget during the early 'heating' phase and then gradually shrink the budget until the 'cooling' phase. This strategy not only facilitates convergence to the global optimum but also reduces communication costs. Experimental results demonstrate that SA-FedLoRA is an efficient FL, achieving superior performance to FedAvg and significantly reducing communication parameters by up to 93.62%.
Machine Learning,Distributed, Parallel, and Cluster Computing
What problem does this paper attempt to address?
The problem that this paper attempts to solve is: in federated learning (FL), the high communication cost problem encountered during the fine - tuning process of large - scale pre - trained models. Specifically: 1. **High communication cost**: Since large - scale pre - trained models contain a large number of parameters, when updating the model in the federated learning framework, these parameters need to be frequently transmitted between the client and the server, resulting in huge communication overhead. 2. **Limitations of fixed parameter budgets**: Although existing parameter - efficient fine - tuning methods (such as LoRA) reduce the number of trainable parameters, the fixed parameter budget may lead to over - fitting or slow convergence problems. To solve these problems, the authors propose the Simulated Annealing - based Federated Learning with LoRA tuning (SA - FedLora) method. By dynamically allocating the parameter budget and adaptively adjusting the rank of LoRA, it accelerates convergence and reduces communication costs while maintaining model performance. ### Key points of the solution - **Two - stage training**: - **Initiating Stage**: Parameter regularization is introduced to align the global and local optimal solutions, alleviate the federated client drift problem, and accelerate the convergence in the subsequent annealing stage. - **Annealing Stage**: A higher parameter budget (high - rank LoRA) is allocated in the early heating stage, and then the parameter budget is gradually reduced (low - rank LoRA) to prevent over - fitting and reduce communication costs. - **Parameter Scheduler**: By defining a parameter scheduler to adaptively adjust the rank of LoRA, including three scheduling strategies: cubic, linear, and cosine, to optimize parameter allocation. ### Experimental results The experimental results show that SA - FedLora significantly reduces communication costs (up to 93.62%) on CIFAR - 10 and the medical face dataset, and performs excellently in terms of accuracy, even outperforming the traditional FedAvg method. ### Summary SA - FedLora effectively solves the high communication cost problem during the fine - tuning of large - scale pre - trained models in federated learning by introducing the simulated annealing mechanism and dynamically adjusting the rank of LoRA, and improves the convergence speed and performance of the model.