Abstract:Recent studies have shown that prompt-based fine-tuning improves the performance of large Pre-trained Language Models (PLMs) for few-shot text classification. Specifically, this type of method transforms the text classification task into the inherent Masked Language Modeling (MLM) with some natural language prompts and verbalizers to bridge the gap between the pre-training stage and the fine-tuning stage. Yet, it is unclear how the prompting knowledge can be transferred across similar or distant NLP tasks, for the purpose of mutual reinforcement. Based on continuous prompt embeddings, we propose TransPrompt v2 , a novel transferable prompting framework for few-shot learning across similar or distant text classification tasks. For transferable prompt learning across similar tasks, we employ a multi-task meta-knowledge acquisition procedure to train a meta-learner that captures the cross-task transferable knowledge. For learning across distant tasks, we further introduce the task type descriptions and propose intra-type and inter-type prompt encoders to capture implicit mutual relations among multiple distant tasks. Additionally, two de-biasing techniques are further designed to make the trained meta-learner more task-agnostic and unbiased towards any tasks. After that, the meta-learner can be fine-tuned for specific tasks with better parameters initialization. Extensive experiments show that TransPrompt v2 outperforms single-task and cross-task strong baselines over multiple NLP tasks and datasets. We further show that the meta-learner can effectively improve the performance of PLMs on previously unseen tasks. In addition, TransPrompt v2 also outperforms strong fine-tuning baselines when learning with full training sets.

Differentiable Prompt Makes Pre-trained Language Models Better Few-shot Learners

Differentiable Prompt Makes Pre-trained Language Models Better Few-shot Learners

What Makes Pre-trained Language Models Better Zero/Few-shot Learners?

Helping Language Models Learn More: Multi-dimensional Task Prompt for Few-shot Tuning

AdaPrompt: Adaptive Model Training for Prompt-based NLP

Pre-trained Language Models Improve the Few-shot Prompt Ability of Decision Transformer

Dialogue for Prompting: a Policy-Gradient-Based Discrete Prompt Generation for Few-shot Learning

Unified Prompt Learning Makes Pre-Trained Language Models Better Few-Shot Learners

Prompt2Model: Generating Deployable Models from Natural Language Instructions

Language Models as Few-Shot Learner for Task-Oriented Dialogue Systems

Eliciting Knowledge from Pretrained Language Models for Prototypical Prompt Verbalizer

TransPrompt V2: Transferable Prompt-based Fine-tuning for Few-shot Text Classification

RLPrompt: Optimizing Discrete Text Prompts with Reinforcement Learning

Leveraging Zero-Shot Prompting for Efficient Language Model Distillation

Exploring Lottery Prompts for Pre-trained Language Models

PPT: Pre-trained Prompt Tuning for Few-shot Learning

LM-CPPF: Paraphrasing-Guided Data Augmentation for Contrastive Prompt-Based Few-Shot Fine-Tuning

BayesPrompt: Prompting Large-Scale Pre-Trained Language Models on Few-shot Inference via Debiased Domain Abstraction

Instances Need More Care: Rewriting Prompts for Instances with LLMs in the Loop Yields Better Zero-Shot Performance

Learning to Generate Prompts for Dialogue Generation through Reinforcement Learning

Bidirectional Language Models Are Also Few-shot Learners