UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory
Haiwen Diao,Bo Wan,Ying Zhang,Xu Jia,Huchuan Lu,Long Chen
DOI: https://doi.org/10.1109/cvpr52733.2024.02714
2024-01-01
Computer Vision and Pattern Recognition
Abstract:Parameter-efficient transfer learning (PETL), i.e., fine-tuning a smallportion of parameters, is an effective strategy for adapting pre-trained modelsto downstream domains. To further reduce the memory demand, recent PETL worksfocus on the more valuable memory-efficient characteristic. In this paper, weargue that the scalability, adaptability, and generalizability ofstate-of-the-art methods are hindered by structural dependency and pertinencyon specific pre-trained backbones. To this end, we propose a newmemory-efficient PETL strategy, Universal Parallel Tuning (UniPT), to mitigatethese weaknesses. Specifically, we facilitate the transfer process via alightweight and learnable parallel network, which consists of: 1) A parallelinteraction module that decouples the sequential connections and processes theintermediate activations detachedly from the pre-trained network. 2) Aconfidence aggregation module that learns optimal strategies adaptively forintegrating cross-layer features. We evaluate UniPT with different backbones(e.g., T5, VSE∞, CLIP4Clip, Clip-ViL, and MDETR) on variousvision-and-language and pure NLP tasks. Extensive ablations on 18 datasets havevalidated that UniPT can not only dramatically reduce memory consumption andoutperform the best competitor, but also achieve competitive performance overother plain PETL methods with lower training memory overhead. Our code ispublicly available at: https://github.com/Paranioar/UniPT.