Personalized Collaborative Fine-Tuning for On-Device Large Language Models

Nicolas Wagner,Dongyang Fan,Martin Jaggi
2024-08-07
Abstract:We explore on-device self-supervised collaborative fine-tuning of large language models with limited local data availability. Taking inspiration from the collaborative learning community, we introduce three distinct trust-weighted gradient aggregation schemes: weight similarity-based, prediction similarity-based and validation performance-based. To minimize communication overhead, we integrate Low-Rank Adaptation (LoRA) and only exchange LoRA weight updates. Our protocols, driven by prediction and performance metrics, surpass both FedAvg and local fine-tuning methods, which is particularly evident in realistic scenarios with more diverse local data distributions. The results underscore the effectiveness of our approach in addressing heterogeneity and scarcity within local datasets.
Computation and Language,Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the personalized collaborative fine - tuning of large language models (LLMs) on devices, especially in the case of limited local data. Specifically, the paper explores how to improve the personalized performance of large language models through user - to - user collaboration by leveraging limited local data resources while protecting user privacy. Since the local data distribution of each user may be heterogeneous and scarce, traditional local fine - tuning methods may not be effective. Therefore, new collaborative strategies need to be developed to overcome these challenges. The paper proposes three trust - weighted gradient aggregation schemes: weight - similarity - based, prediction - similarity - based, and verification - performance - based. These schemes aim to improve the effectiveness of collaborative fine - tuning by reducing communication overhead (such as only exchanging low - rank adaptation (LoRA) weight updates) and optimizing the gradient aggregation process. Experimental results show that the proposed collaborative protocol is superior to FedAvg and local fine - tuning methods in dealing with data heterogeneity and scarcity, especially when there is a more diverse local data distribution in real - world scenarios. In conclusion, the goal of this research is to explore the possibility of personalized collaborative fine - tuning on devices to overcome local data limitations and improve the personalized performance of large language models in different user environments.