STAR: Constraint LoRA with Dynamic Active Learning for Data-Efficient Fine-Tuning of Large Language Models

Linhai Zhang,Jialong Wu,Deyu Zhou,Guoqiang Xu

2024-06-06

Abstract:Though Large Language Models (LLMs) have demonstrated the powerful capabilities of few-shot learning through prompting methods, supervised training is still necessary for complex reasoning tasks. Because of their extensive parameters and memory consumption, both Parameter-Efficient Fine-Tuning (PEFT) methods and Memory-Efficient Fine-Tuning methods have been proposed for LLMs. Nevertheless, the issue of large annotated data consumption, the aim of Data-Efficient Fine-Tuning, remains unexplored. One obvious way is to combine the PEFT method with active learning. However, the experimental results show that such a combination is not trivial and yields inferior results. Through probe experiments, such observation might be explained by two main reasons: uncertainty gap and poor model calibration. Therefore, in this paper, we propose a novel approach to effectively integrate uncertainty-based active learning and LoRA. Specifically, for the uncertainty gap, we introduce a dynamic uncertainty measurement that combines the uncertainty of the base model and the uncertainty of the full model during the iteration of active learning. For poor model calibration, we incorporate the regularization method during LoRA training to keep the model from being over-confident, and the Monte-Carlo dropout mechanism is employed to enhance the uncertainty estimation. Experimental results show that the proposed approach outperforms existing baseline models on three complex reasoning tasks.

Computation and Language,Artificial Intelligence

What problem does this paper attempt to address?

This paper explores the problem of low data efficiency in fine-tuning large-scale language models (LLMs). Although LLMs have demonstrated powerful few-shot learning abilities through prompt engineering methods, complex reasoning tasks still require supervised training. Due to the large number of parameters and high memory consumption, parameter-efficient fine-tuning (PEFT) and memory-efficient fine-tuning (MEFT) methods have been proposed. However, these methods still require a large amount of annotated data, indicating that the problem of low data efficiency has not been resolved. The paper proposes a new method called STAR (dynamic constrained LoRA with active learning) to effectively combine uncertainty-based active learning and low-rank adaptation (LoRA). It is found that directly combining LoRA with active learning is not ideal due to the gap in uncertainties and poor model calibration. To address these issues, STAR introduces a dynamic uncertainty measure that combines the uncertainties of the base model and the complete model, and applies regularization methods in LoRA training to prevent overconfidence of the model. It also utilizes Monte Carlo dropout mechanism to enhance uncertainty estimation. Experimental results show that the STAR method outperforms existing baseline models in three complex reasoning tasks, indicating its effectiveness in addressing the aforementioned problem and improving data efficiency.

STAR: Constraint LoRA with Dynamic Active Learning for Data-Efficient Fine-Tuning of Large Language Models

DLoRA: Distributed Parameter-Efficient Fine-Tuning Solution for Large Language Model

LoRA ensembles for large language model fine-tuning

MoELoRA: Contrastive Learning Guided Mixture of Experts on Parameter-Efficient Fine-Tuning for Large Language Models

SplitLoRA: A Split Parameter-Efficient Fine-Tuning Framework for Large Language Models

Flexora: Flexible Low Rank Adaptation for Large Language Models

BA-LoRA: Bias-Alleviating Low-Rank Adaptation to Mitigate Catastrophic Inheritance in Large Language Models

TeamLoRA: Boosting Low-Rank Adaptation with Expert Collaboration and Competition

DLP-LoRA: Efficient Task-Specific LoRA Fusion with a Dynamic, Lightweight Plugin for Large Language Models

KD-LoRA: A Hybrid Approach to Efficient Fine-Tuning with LoRA and Knowledge Distillation

HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning

LoRA-FA: Memory-efficient Low-rank Adaptation for Large Language Models Fine-tuning

Chain of LoRA: Efficient Fine-tuning of Language Models via Residual Learning

Deconfounded Causality-aware Parameter-Efficient Fine-Tuning for Problem-Solving Improvement of LLMs

Sparse Matrix in Large Language Model Fine-tuning

PeriodicLoRA: Breaking the Low-Rank Bottleneck in LoRA Optimization

LoRETTA: Low-Rank Economic Tensor-Train Adaptation for Ultra-Low-Parameter Fine-Tuning of Large Language Models

ALoRA: Allocating Low-Rank Adaptation for Fine-tuning Large Language Models

Federated Learning of Large Language Models with Parameter-Efficient Prompt Tuning and Adaptive Optimization

MiLoRA: Efficient Mixture of Low-Rank Adaptation for Large Language Models Fine-tuning

MLAE: Masked LoRA Experts for Visual Parameter-Efficient Fine-Tuning