Abstract:Pre-trained code models (e.g. CodeBERT and CodeT5) have demonstrated their code intelligence in various software engineering tasks, such as code summarization. And full fine-tuning has become the typical approach to adapting these models to downstream tasks. However, full fine-tuning these large models can be computationally expensive and memory-intensive, particularly when training for multiple tasks. To alleviate this issue, several parameter-efficient fine-tuning methods (e.g. Adapter and LoRA) have been proposed to only train a small number of additional parameters, while keeping the original pre-trained parameters frozen. Although these methods claim superiority over the prior techniques, they seldom make a comprehensive and fair comparison on multiple software engineering tasks. Moreover, besides their potential in reducing fine-tuning costs and maintaining approximate performance, the effectiveness of these methods in low-resource, cross-language, and cross-project scenarios is inadequately studied. To this end, we first conduct experiments by fine-tuning state-of-the-art code models with these methods on both code understanding tasks and code generation tasks. The results show that, by tuning only 0.5% additional parameters, these methods may achieve comparable or higher performance than full fine-tuning in code understanding tasks, but they may exhibit slightly weaker performance in code generation tasks. We also investigate the impact of these methods with varying numbers of training samples and find that, a considerable number of samples (e.g. 1000 for clone detection) may be required for them to approximate the performance of full fine-tuning. Our experimental results in cross-language and cross-project scenarios demonstrate that by freezing most pre-trained parameters and tuning only 0.5% additional parameters, these methods achieve consistent improvements in models' transfer learning ability in comparison to full fine-tuning. Our code and data are available at https://github.com/anonymous-ase23/ CodeModelParameterEfficientFinetuning.

An Empirical Study of Parameter-Efficient Fine-Tuning Methods for Pre-Trained Code Models.

Delving into Parameter-Efficient Fine-Tuning in Code Change Learning: an Empirical Study

Parameter-efficient Tuning for Large Language Model Without Calculating Its Gradients

Parameter-Efficient Fine-Tuning of Large Language Models for Unit Test Generation: An Empirical Study

Empirical Analysis of Efficient Fine-Tuning Methods for Large Pre-Trained Language Models

No More Fine-Tuning? An Experimental Evaluation of Prompt Tuning in Code Intelligence

Parameter-Efficient Fine-Tuning With Adapters

Parameter Efficient Fine Tuning: A Comprehensive Analysis Across Applications

Refining Joint Text and Source Code Embeddings for Retrieval Task with Parameter-Efficient Fine-Tuning

Parameter-Efficient Fine-Tuning for Large Models: A Comprehensive Survey

Scaling Down to Scale Up: A Guide to Parameter-Efficient Fine-Tuning

Parameter Efficient Instruction Tuning: An Empirical Study

A Comprehensive Evaluation of Parameter-Efficient Fine-Tuning on Software Engineering Tasks

See Further for Parameter Efficient Fine-tuning by Standing on the Shoulders of Decomposition

Parameter-efficient fine-tuning of large-scale pre-trained language models

One Adapter for All Programming Languages? Adapter Tuning for Code Search and Summarization

Arbitrary Few Parameters Are Good Enough for Adapting Large-scale Pre-trained Language Models

Exploring Parameter-Efficient Fine-Tuning Techniques for Code Generation with Large Language Models

Pass-Tuning: Towards Structure-Aware Parameter-Efficient Tuning for Code Representation Learning

On the Effectiveness of Adapter-based Tuning for Pretrained Language Model Adaptation

Parameter-Efficient Fine-Tuning via Selective Discrete Cosine Transform