Abstract:Although spiking neural networks (SNNs) have made great progress on both performance and efficiency over the last few years, their unique working pattern makes it hard to train high-performance low-latency SNNs and their development still lags behind traditional artificial neural networks (ANNs). To compensate this gap, many extraordinary works have been proposed, but these works are mainly based on the same network structure (i.e. CNN) and their performance is worse than their ANN counterparts, which limits the applications of SNNs. To this end, we propose a Transformer-based SNN, termed ”Spikeformer”, which outperforms its ANN counterpart on both static dataset and neuromorphic datasets. First, to deal with the problem of “data hungry” and the unstable training period exhibited in the vanilla model, we design the Convolutional Tokenizer (CT) module, which stabilizes training and improves the accuracy of the original model on DVS-Gesture by more than 16%. Besides, we integrate Spatio-Temporal Attention (STA) into Spikeformer to better incorporate the attention mechanism inside Transformer and the spatio-temporal information inherent to SNN. With our proposed method, we achieve 98.96%/75.89% top-1 accuracy on DVS-Gesture/ImageNet datasets with 16/4 simulation time steps. On DVS-CIFAR10, we further conduct energy consumption analysis and obtain 81.4%/80.3% top-1 accuracy with 4/1 time step(s), achieving 1.7/6.4 × energy efficiency over its ANN counterpart. Moreover, our Spikeformer outperforms its ANN counterpart by 3.13% and 0.12% on DVS-Gesture and ImageNet respectively, indicating that Spikeformer may be a more suitable architecture for training SNNs compared to CNN. We believe that this work shall promote the development of SNNs to be in step with ANNs as much as possible. Code will be publicly available.

Towards High-performance Spiking Transformers from ANN to SNN Conversion

Spike Trains Encoding and Threshold Rescaling Method for Deep Spiking Neural Networks

Scaling Spike-driven Transformer with Efficient Spike Firing Approximation Training

SpikeZIP-TF: Conversion is All You Need for Transformer-based SNN

Spikeformer: Training high-performance spiking neural network with transformer

Masked Spiking Transformer

Optimizing Power Efficiency: Converting Recurrent Neural Networks to Spiking Neural Networks for Time-Domain Analysis

Optimal ANN-SNN Conversion for Fast and Accurate Inference in Deep Spiking Neural Networks

Optimal ANN-SNN Conversion for High-accuracy and Ultra-low-latency Spiking Neural Networks

LDD: High-Precision Training of Deep Spiking Neural Network Transformers Guided by an Artificial Neural Network

Toward High-Accuracy and Low-Latency Spiking Neural Networks With Two-Stage Optimization

Spatio-Temporal Approximation: A Training-Free SNN Conversion for Transformers

SpikingResformer: Bridging ResNet and Vision Transformer in Spiking Neural Networks

Optimal Conversion of Conventional Artificial Neural Networks to Spiking Neural Networks

SpikingMiniLM: Energy-Efficient Spiking Transformer for Natural Language Understanding

Training-free Conversion of Pretrained ANNs to SNNs for Low-Power and High-Performance Applications

Converting High-Performance and Low-Latency SNNs through Explicit Modelling of Residual Error in ANNs

Optimized Potential Initialization for Low-latency Spiking Neural Networks

Spike Calibration: Bridging the Gap between ANNs and SNNs in ANN-SNN Conversion

Bridging the Gap between ANNs and SNNs by Calibrating Offset Spikes

Spiking Deep Residual Networks.