Chain of LoRA: Efficient Fine-tuning of Language Models via Residual Learning

Wenhan Xia,Chengwei Qin,Elad Hazan
2024-01-08
Abstract:Fine-tuning is the primary methodology for tailoring pre-trained large language models to specific tasks. As the model's scale and the diversity of tasks expand, parameter-efficient fine-tuning methods are of paramount importance. One of the most widely used family of methods is low-rank adaptation (LoRA) and its variants. LoRA encodes weight update as the product of two low-rank matrices. Despite its advantages, LoRA falls short of full-parameter fine-tuning in terms of generalization error for certain tasks.
Machine Learning,Computation and Language
What problem does this paper attempt to address?