DePT: Decoupled Prompt Tuning

Ji Zhang,Shihan Wu,Lianli Gao,Heng Tao Shen,Jingkuan Song
DOI: https://doi.org/10.1109/cvpr52733.2024.01228
2024-01-01
Computer Vision and Pattern Recognition
Abstract:This work breaks through the Base-New Tradeoff (BNT)dilemma in prompt tuning,i.e., the better the tuned model generalizes to the base (or target) task, theworse it generalizes to new tasks, and vice versa. Specifically, through anin-depth analysis of the learned features of the base and new tasks, we observethat the BNT stems from a channel bias issue, i.e., the vast majority offeature channels are occupied by base-specific knowledge, resulting in thecollapse of taskshared knowledge important to new tasks. To address this, wepropose the Decoupled Prompt Tuning (DePT) framework, which decouplesbase-specific knowledge from feature channels into an isolated feature spaceduring prompt tuning, so as to maximally preserve task-shared knowledge in theoriginal feature space for achieving better zero-shot generalization on newtasks. Importantly, our DePT is orthogonal to existing prompt tuning methods,hence it can improve all of them. Extensive experiments on 11 datasets show thestrong flexibility and effectiveness of DePT. Our code and pretrained modelsare available at https://github.com/Koorye/DePT.
What problem does this paper attempt to address?