Spatial-Temporal Spiking Feature Pruning in Spiking Transformer

Zhaokun Zhou,Kaiwei Che,Jun Niu,Man Yao,Guoqi Li,Li Yuan,Guibo Luo,Yuesheng Zhu
DOI: https://doi.org/10.1109/tcds.2024.3500018
IF: 4.546
2024-01-01
IEEE Transactions on Cognitive and Developmental Systems
Abstract:Spiking Neural Networks (SNNs) are known for brain-inspired architecture and low power consumption. Leveraging bio-compatibility and self-attention mechanism, Spiking Transformers become the most promising SNN architecture with high accuracy. However, Spiking Transformers still face the challenge of high training costs, such as a 51 M network requiring 181 training hours on ImageNet. In this work, we explore feature pruning to reduce training costs and overcome two challenges: high pruning ratio and lightweight pruning methods. We first analyze the spiking features and find the potential for a high pruning ratio. The majority of information is concentrated on a part of the spiking features in Spiking Transformer, which suggests that we can keep this part of the tokens and prune the others. To achieve lightweight, a parameter-free Spatial-Temporal Spiking Feature Pruning method is proposed, which uses only a simple addition-sorting operation. The spiking features/tokens with high spike accumulation values are selected for training. The others are pruned and merged through a compensation module called Softmatch. Experimental results demonstrate that our method reduces training costs without compromising image classification accuracy. On ImageNet, our approach reduces the training time from 181 hours to 128 hours while achieving comparable accuracy (83.13% vs. 83.07%).
What problem does this paper attempt to address?