Abstract:Pre-trained models have been shown effective in many code intelligence tasks, such as automatic code summarization and defect prediction. These models are pre-trained on large-scale unlabeled corpus and then fine-tuned in downstream tasks. However, as the inputs to pre-training and downstream tasks are in different forms, it is hard to fully explore the knowledge of pre-trained models. Besides, the performance of fine-tuning strongly relies on the amount of downstream task data, while in practice, the data scarcity scenarios are common. Recent studies in the natural language processing (NLP) field show that prompt tuning, a new paradigm for tuning, alleviates the above issues and achieves promising results in various NLP tasks. In prompt tuning, the prompts inserted during tuning provide task-specific knowledge, which is especially beneficial for tasks with relatively scarce data. In this article, we empirically evaluate the usage and effect of prompt tuning in code intelligence tasks. We conduct prompt tuning on popular pre-trained models CodeBERT and CodeT5 and experiment with four code intelligence tasks including defect prediction, code search, code summarization, and code translation. Our experimental results show that prompt tuning consistently outperforms fine-tuning in all four tasks. In addition, prompt tuning shows great potential in low-resource scenarios, e.g., improving the BLEU scores of fine-tuning by more than 26% on average for code summarization. Our results suggest that instead of fine-tuning, we could adapt prompt tuning for code intelligence tasks to achieve better performance, especially when lacking task-specific data. We also discuss the implications for adapting prompt tuning in code intelligence tasks.

Prompt Tuning in Code Intelligence: An Experimental Evaluation

No More Fine-Tuning? An Experimental Evaluation of Prompt Tuning in Code Intelligence

P-Tuning v2: Prompt Tuning Can Be Comparable to Fine-tuning Universally Across Scales and Tasks

P-Tuning: Prompt Tuning Can Be Comparable to Fine-tuning Across Scales and Tasks

APrompt: Attention Prompt Tuning for Efficient Adaptation of Pre-trained Language Models

Dynamic Prompting: A Unified Framework for Prompt Tuning

ACCEPT: Adaptive Codebook for Composite and Efficient Prompt Tuning

Prompt Tuning for Generative Multimodal Pretrained Models

Pro-tuning: Unified Prompt Tuning for Vision Tasks

LIPT: Improving Prompt Tuning with Late Inception Reparameterization

Making Pre-trained Language Models End-to-end Few-shot Learners with Contrastive Prompt Tuning

Instance-wise Prompt Tuning for Pretrained Language Models

Subgraph-level Universal Prompt Tuning

Context-Focused Prompt Tuning Pre-Trained Code Models to Improve Code Summarization

Revisiting Prefix-tuning: Statistical Benefits of Reparameterization among Prompts

When Do Prompting and Prefix-Tuning Work? A Theory of Capabilities and Limitations

Multitask Prompt Tuning Enables Parameter-Efficient Transfer Learning

Efficient Prompt Tuning by Multi-Space Projection and Prompt Fusion

Instance-aware Dynamic Prompt Tuning for Pre-trained Point Cloud Models

Exploring the Curious Case of Code Prompts