Pear: Pruning and Sharing Adapters in Visual Parameter-Efficient Fine-Tuning

Yibo Zhong,Yao Zhou
2024-09-29
Abstract:Adapters have been widely explored to alleviate computational and storage costs when fine-tuning pretrained foundation models. However, the adapter itself can exhibit redundancy, leading to unnecessary storage overhead and inferior performance. In this paper, we propose Prune and Share (Pear), a novel adapter-pruning framework for efficient fine-tuning of pretrained visual foundation models. Specifically, we prune certain adapters and share the more important unpruned ones with positions where adapters are pruned, allowing continual adaptation at these positions after pruning. Additionally, a knowledge checkpoint strategy is introduced, which preserves the information of the pruned adapters and further boosts performance. Experimental results on visual adaptation benchmark validate the effectiveness and efficiency of the proposed Pear comparing to other competitive methods. Code is in <a class="link-external link-https" href="https://github.com/yibozhong/pear" rel="external noopener nofollow">this https URL</a>.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is: when fine - tuning pre - trained visual foundation models, how to reduce redundant parameters through pruning and sharing adapters, thereby reducing computational and storage costs while maintaining or improving model performance. Specifically, although existing adapters can significantly reduce the number of parameters and GPU memory usage, they may still be redundant themselves, resulting in unnecessary storage overhead and performance degradation. To solve this problem, the authors propose the Prune and Share (Pear) method, which aims to achieve efficient fine - tuning by pruning unimportant adapters and sharing important ones. ### Main problem summary: 1. **Redundant adapters**: Existing adapters are themselves redundant, resulting in unnecessary storage overhead and performance degradation. 2. **Limitations of traditional pruning methods**: Traditional structured pruning methods directly remove adapters, causing the pruned positions to be unable to continue adapting, affecting the overall performance. 3. **Information loss**: Directly pruning adapters will discard the information they contain. Even if these adapters are relatively unimportant, retaining this information is still helpful for more comprehensive adaptation. ### Solutions: - **Prune and Share (Pear)**: By pruning unimportant adapters and sharing the remaining important adapters to the pruned positions, all positions can continue to adapt without introducing additional parameters. - **Knowledge Checkpoint**: Retain the information of the pruned adapters to further improve the adaptation effect. Through these methods, Pear can reduce the number of parameters while maintaining or even improving the model performance, especially in resource - constrained situations (such as limited GPU memory or lightweight devices).