PECTP: Parameter-Efficient Cross-Task Prompts for Incremental Vision Transformer

Qian Feng,Hanbin Zhao,Chao Zhang,Jiahua Dong,Henghui Ding,Yu-Gang Jiang,Hui Qian
2024-07-04
Abstract:Incremental Learning (IL) aims to learn deep models on sequential tasks continually, where each new task includes a batch of new classes and deep models have no access to task-ID information at the inference time. Recent vast pre-trained models (PTMs) have achieved outstanding performance by prompt technique in practical IL without the old samples (rehearsal-free) and with a memory constraint (memory-constrained): Prompt-extending and Prompt-fixed methods. However, prompt-extending methods need a large memory buffer to maintain an ever-expanding prompt pool and meet an extra challenging prompt selection problem. Prompt-fixed methods only learn a single set of prompts on one of the incremental tasks and can not handle all the incremental tasks effectively. To achieve a good balance between the memory cost and the performance on all the tasks, we propose a Parameter-Efficient Cross-Task Prompt (PECTP) framework with Prompt Retention Module (PRM) and classifier Head Retention Module (HRM). To make the final learned prompts effective on all incremental tasks, PRM constrains the evolution of cross-task prompts' parameters from Outer Prompt Granularity and Inner Prompt Granularity. Besides, we employ HRM to inherit old knowledge in the previously learned classifier heads to facilitate the cross-task prompts' generalization ability. Extensive experiments show the effectiveness of our method. The source codes will be available at \url{<a class="link-external link-https" href="https://github.com/RAIAN08/PECTP" rel="external noopener nofollow">this https URL</a>}.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to effectively utilize pre - trained models (PTMs) for parameter - efficient cross - task prompt learning in the rehearsal - free and memory - constrained incremental learning (IL) scenarios. Specifically, the paper focuses on how to enable deep models to continuously learn from new tasks, maintain good performance on previously learned tasks, and avoid catastrophic forgetting without accessing old samples and with limited memory resources. ### Main Problems 1. **Catastrophic Forgetting**: During the incremental learning process, the model may forget previously learned tasks when learning new tasks. 2. **Memory Limitations**: In practical applications, due to privacy and memory limitations, it is impossible to store samples of old tasks. 3. **Prompt Selection Problem**: Existing prompt expansion methods need to continuously expand the prompt pool and select an appropriate set of prompts during inference, which will lead to increased memory overhead and additional computational costs. 4. **Limitations of a Single Prompt Set**: Although existing fixed - prompt methods save memory, a single task - specific prompt set is difficult to guide the model to perform well in subsequent incremental tasks. ### Solutions To solve the above problems, the paper proposes a parameter - efficient cross - task prompt framework (PECTP), which mainly consists of the following two modules: 1. **Prompt Retention Module (PRM)**: - **Outer Prompt Granularity (OPG)**: By constraining the feature output of prompt generation, the model will not forget the knowledge of old tasks when learning new tasks. Specific implementations include point - to - point, block - to - block, patch - to - patch, and dimension - to - dimension feature distribution constraints. - **Inner Prompt Granularity (IPG)**: Directly constrain the changes of prompt parameters to ensure that the learning of new tasks will not destroy the knowledge of old tasks. 2. **Head Retention Module (HRM)**: - By inheriting the knowledge of the classification head already learned in old tasks, further enhance the generalization ability of cross - task prompts. ### Experimental Results The paper verifies the effectiveness of PECTP through experiments on multiple benchmark datasets. The experimental results show that PECTP is superior to existing incremental learning methods in both performance and memory cost. ### Summary The paper proposes a new incremental learning framework PECTP. Through parameter - efficient cross - task prompts and prompt retention modules, it effectively solves the incremental learning problems in the rehearsal - free and memory - constrained scenarios. This method not only performs well in performance but also is very efficient in memory usage.