Abstract:Fine-tuning techniques based on Large Pretrained Language Models (LPLMs) have been proven to significantly enhance model performance on a variety of downstream tasks and effectively control the output behaviors of LPLMs. Recent studies have proposed numerous methods for fine-tuning a small number of parameters based on open-source LPLMs, reducing the demand for computational and storage resources. Among these, reparameterization fine-tuning methods represented by LoRA (Low-Rank Adaptation) have gained popularity. We find that although these methods perform well in many aspects, there is still considerable room for improvement in terms of complex task adaptability, performance, stability, and algorithm complexity. In response to this, inspired by the idea that the functions of the brain are shaped by its geometric structure, this paper integrates this idea into LoRA technology and proposes a new matrix transformation-based reparameterization method for efficient fine-tuning, named Matrix-Transformation based Low-Rank Adaptation (MTLoRA). MTLoRA aims to dynamically alter its spatial geometric structure by applying a transformation-matrix T to perform linear transformations, such as rotation, scaling, and translation, on the task-specific parameter matrix, generating new matrix feature patterns (eigenvectors) to mimic the fundamental influence of complex geometric structure feature patterns in the brain on functions, thereby enhancing the model's performance in downstream tasks. In Natural Language Understanding (NLU) tasks, it is evaluated using the GLUE benchmark test, and the results reveal that MTLoRA achieves an overall performance increase of about 1.0% across eight tasks; in Natural Language Generation (NLG) tasks, MTLoRA improves performance by an average of 0.95% and 0.56% in the DART and WebNLG tasks, respectively.

ASLoRA: Adaptive Sharing Low-Rank Adaptation Across Layers

LoRA$^2$ : Multi-Scale Low-Rank Approximations for Fine-Tuning Large Language Models

ShareLoRA: Parameter Efficient and Robust Large Language Model Fine-tuning via Shared Low-Rank Adaptation

SuperLoRA: Parameter-Efficient Unified Adaptation of Multi-Layer Attention Modules

LoRA-XS: Low-Rank Adaptation with Extremely Small Number of Parameters

CoRA: Optimizing Low-Rank Adaptation with Common Subspace of Large Language Models

LoRA-SP: Streamlined Partial Parameter Adaptation for Resource-Efficient Fine-Tuning of Large Language Models

Sparse Low-rank Adaptation of Pre-trained Language Models

LoRA-Mini : Adaptation Matrices Decomposition and Selective Training

SwitchLoRA: Switched Low-Rank Adaptation Can Learn Full-Rank Information

ALoRA: Allocating Low-Rank Adaptation for Fine-tuning Large Language Models

LoRA+: Efficient Low Rank Adaptation of Large Models

Matrix-Transformation Based Low-Rank Adaptation (MTLoRA): A Brain-Inspired Method for Parameter-Efficient Fine-Tuning

ALLoRA: Adaptive Learning Rate Mitigates LoRA Fatal Flaws

Structure-Aware Low-Rank Adaptation for Parameter-Efficient Fine-Tuning

ResLoRA: Identity Residual Mapping in Low-Rank Adaption

Delta-LoRA: Fine-Tuning High-Rank Parameters with the Delta of Low-Rank Matrices

MultiLoRA: Democratizing LoRA for Better Multi-Task Learning

LoRA-FA: Memory-efficient Low-rank Adaptation for Large Language Models Fine-tuning

AdaLoRA: Adaptive Budget Allocation for Parameter-Efficient Fine-Tuning

Enhancing Parameter Efficiency and Generalization in Large-Scale Models: A Regularized and Masked Low-Rank Adaptation Approach