Personalized Federated Fine-Tuning for LLMs via Data-Driven Heterogeneous Model Architectures

Yicheng Zhang,Zhen Qin,Zhaomin Wu,Shuiguang Deng
2024-11-28
Abstract:A large amount of instructional text data is essential to enhance the performance of pre-trained large language models (LLMs) for downstream tasks. This data can contain sensitive information and therefore cannot be shared in practice, resulting in data silos that limit the effectiveness of LLMs on various tasks. Federated learning (FL) enables collaborative fine-tuning across different clients without sharing their data. Nonetheless, in practice, this instructional text data is highly heterogeneous in both quantity and distribution across clients, necessitating distinct model structures to best accommodate the variations. However, existing federated fine-tuning approaches either enforce the same model structure or rely on predefined ad-hoc architectures unaware of data distribution, resulting in suboptimal performance. To address this challenge, we propose FedAMoLE, a lightweight personalized federated fine-tuning framework that leverages data-driven heterogeneous model architectures. FedAMoLE introduces the Adaptive Mixture of LoRA Experts (AMoLE) module, which facilitates model heterogeneity with minimal communication overhead by allocating varying numbers of LoRA-based domain experts to each client. Furthermore, we develop a reverse selection-based expert assignment (RSEA) strategy, which enables data-driven model architecture adjustment during fine-tuning by allowing domain experts to select clients that best align with their knowledge domains. Extensive experiments across six different scenarios of data heterogeneity demonstrate that FedAMoLE significantly outperforms existing methods for federated LLM fine-tuning, achieving superior accuracy while maintaining good scalability.
Machine Learning
What problem does this paper attempt to address?
The main problems that this paper attempts to solve are: 1. **Data silo problem**: Large - scale language models (LLMs) require a large amount of instructional text data to improve performance when performing downstream tasks. However, this data may contain sensitive information and thus cannot be directly shared, resulting in the phenomenon of data silos, which limits the effectiveness of LLMs on various tasks. 2. **Problem of personalized model adaptability under heterogeneous data distribution**: In practical applications, the data of different clients is highly heterogeneous in quantity and distribution. Existing federated fine - tuning methods either force the use of the same model structure or rely on predefined architectures that are not sensitive to data distribution, which leads to sub - optimal performance. 3. **Limitations of existing methods**: - **Limited support for model heterogeneity**: In highly heterogeneous scenarios, the amount of data of some clients is very limited, and different - sized models need to be used to avoid over - fitting. However, existing federated fine - tuning methods are often difficult to effectively support model heterogeneity. - **Data - unaware model structure**: Mainstream federated fine - tuning methods mainly rely on manually predefined model architectures. These architectures usually only consider local resource constraints or local data characteristics and fail to comprehensively consider cross - client data features, resulting in poor performance in statistically heterogeneous data environments. To solve these problems, the paper proposes FedAMoLE, a lightweight personalized federated fine - tuning framework. By introducing the Adaptive Mixture of LoRA Experts (AMoLE) module and supporting the data - driven model architecture adjustment strategy (RSEA), it realizes model heterogeneity and data - aware model architecture design with low communication overhead. Specifically, FedAMoLE aims to achieve the following goals: - Support model heterogeneity at the architecture level to better adapt to the data distribution of different clients. - Customize a personalized model architecture for each client by jointly considering local and global data features. - Achieve the above goals with a reasonable additional communication overhead. Through these improvements, FedAMoLE significantly improves the federated LLM fine - tuning performance in heterogeneous data environments while maintaining good scalability.