Abstract:Large models represent a groundbreaking advancement in multiple application fields, enabling remarkable achievements across various tasks. However, their unprecedented scale comes with significant computational costs. These models, often consisting of billions of parameters, require vast amounts of computational resources for execution. Especially, the expansive scale and computational demands pose considerable challenges when customizing them for particular downstream tasks, particularly over the hardware platforms constrained by computational capabilities. Parameter Efficient Fine-Tuning (PEFT) provides a practical solution by efficiently adapt the large models over the various downstream tasks. In particular, PEFT refers to the process of adjusting the parameters of a pre-trained large models to adapt it to a specific task while minimizing the number of additional parameters introduced or computational resources required. This approach is particularly important when dealing with large language models with high parameter counts, as fine-tuning these models from scratch can be computationally expensive and resource-intensive, posing considerable challenges in the supporting system platform design. In this survey, we present comprehensive studies of various PEFT algorithms, examining their performance and computational overhead. Moreover, we provide an overview of applications developed using different PEFT algorithms and discuss common techniques employed to mitigate computation costs for PEFT. In addition to the algorithmic perspective, we overview various real-world system designs to investigate the implementation costs associated with different PEFT algorithms. This survey serves as an indispensable resource for researchers aiming to understand both the PEFT algorithm and its system implementation, offering detailed insights into recent advancements and practical applications.

Parameter-efficient fine-tuning of large-scale pre-trained language models

Delta Tuning: A Comprehensive Study of Parameter Efficient Methods for Pre-trained Language Models

Parameter-Efficient Fine-Tuning Methods for Pretrained Language Models: A Critical Review and Assessment

Arbitrary Few Parameters Are Good Enough for Adapting Large-scale Pre-trained Language Models

Scaling Down to Scale Up: A Guide to Parameter-Efficient Fine-Tuning

Making Parameter-efficient Tuning More Efficient: A Unified Framework for Classification Tasks.

Parameter-efficient Tuning for Large Language Model Without Calculating Its Gradients

OpenDelta: A Plug-and-play Library for Parameter-efficient Adaptation of Pre-trained Models

An Empirical Study on Parameter-Efficient Fine-Tuning for MultiModal Large Language Models

Fine-Tuning Pre-Trained Language Models Effectively by Optimizing Subnetworks Adaptively

When Federated Learning Meets Pre-trained Language Models' Parameter-Efficient Tuning Methods

Parameter-Efficient Fine-Tuning for Large Models: A Comprehensive Survey

Non-Intrusive Adaptation: Input-Centric Parameter-efficient Fine-Tuning for Versatile Multimodal Modeling

Learning Global Controller in Latent Space for Parameter-Efficient Fine-Tuning

Towards Better Parameter-Efficient Fine-Tuning for Large Language Models: A Position Paper

Towards a Unified View of Parameter-Efficient Transfer Learning

Parameter Efficient Fine Tuning: A Comprehensive Analysis Across Applications

Parameter-Efficient Fine-Tuning With Adapters

Different Tunes Played with Equal Skill: Exploring a Unified Optimization Subspace for Delta Tuning

Empirical Analysis of Efficient Fine-Tuning Methods for Large Pre-Trained Language Models

Theoretical Insights into Fine-Tuning Attention Mechanism: Generalization and Optimization