A unified pruning framework for vision transformers

Hao Yu,Jianxin Wu
DOI: https://doi.org/10.1007/s11432-022-3646-6
2023-04-14
Science China Information Sciences
Abstract:Conclusion In this study, we proposed a novel method called UP-ViTs to prune ViTs in a unified manner. Our framework can prune all components in a ViT and its variants, maintain the models' structure, and generalize well into downstream tasks. UP-ViTs achieve state-of-the-art results when pruning various ViT backbones. Moreover, we studied the transferring ability of the compressed model and found that our UP-ViTs also outperform original ViTs. We also extended our method into NLP tasks and obtained more efficient transformer models. Please refer to the appendix for more details.
computer science, information systems,engineering, electrical & electronic
What problem does this paper attempt to address?