Parameter-Efficient Fine-Tuning in Large Models: A Survey of Methodologies

Luping Wang,Sheng Chen,Linnan Jiang,Shu Pan,Runze Cai,Sen Yang,Fei Yang
2024-11-01
Abstract:The large models, as predicted by scaling raw forecasts, have made groundbreaking progress in many fields, particularly in natural language generation tasks, where they have approached or even surpassed human levels. However, the unprecedented scale of their parameters brings significant computational and storage costs. These large models require substantial computational resources and GPU memory to operate. When adapting large models to specific downstream tasks, their massive parameter scale poses a significant challenge in fine-tuning on hardware platforms with limited computational power and GPU memory. To address this issue, Parameter-Efficient Fine-Tuning (PEFT) offers a practical solution by efficiently adjusting the parameters of large pre-trained models to suit various downstream tasks. Specifically, PEFT adjusts the parameters of pre-trained large models to adapt to specific tasks or domains, minimizing the introduction of additional parameters and the computational resources required. This review mainly introduces the preliminary knowledge of PEFT, the core ideas and principles of various PEFT algorithms, the applications of PEFT, and potential future research directions. By reading this review, we believe that interested parties can quickly grasp the PEFT methodology, thereby accelerating its development and innovation.
Computation and Language,Artificial Intelligence,Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the methods and challenges of Parameter - Efficient Fine - Tuning (PEFT) in large - scale pre - trained models (such as large language models). With the continuous expansion of the model parameter scale, these large models have made breakthrough progress in tasks such as natural language generation, and even reached or exceeded the human level. However, the huge number of parameters also brings significant computing and storage costs, which makes it very difficult to fine - tune these models for specific downstream tasks on hardware platforms with limited computing resources. To solve this problem, PEFT provides an effective solution. It efficiently adjusts the parameters of the pre - trained model to adapt to different downstream tasks while minimizing the need to introduce additional parameters and required computing resources. The paper mainly introduces the basic knowledge of PEFT, the core ideas and principles of various PEFT algorithms, the applications of PEFT, and future research directions. By reading this review, researchers can quickly master the methodology of PEFT, thus accelerating its development and innovation.