Abstract:Although spiking neural networks (SNNs) have made great progress on both performance and efficiency over the last few years, their unique working pattern makes it hard to train high-performance low-latency SNNs and their development still lags behind traditional artificial neural networks (ANNs). To compensate this gap, many extraordinary works have been proposed, but these works are mainly based on the same network structure (i.e. CNN) and their performance is worse than their ANN counterparts, which limits the applications of SNNs. To this end, we propose a Transformer-based SNN, termed ”Spikeformer”, which outperforms its ANN counterpart on both static dataset and neuromorphic datasets. First, to deal with the problem of “data hungry” and the unstable training period exhibited in the vanilla model, we design the Convolutional Tokenizer (CT) module, which stabilizes training and improves the accuracy of the original model on DVS-Gesture by more than 16%. Besides, we integrate Spatio-Temporal Attention (STA) into Spikeformer to better incorporate the attention mechanism inside Transformer and the spatio-temporal information inherent to SNN. With our proposed method, we achieve 98.96%/75.89% top-1 accuracy on DVS-Gesture/ImageNet datasets with 16/4 simulation time steps. On DVS-CIFAR10, we further conduct energy consumption analysis and obtain 81.4%/80.3% top-1 accuracy with 4/1 time step(s), achieving 1.7/6.4 × energy efficiency over its ANN counterpart. Moreover, our Spikeformer outperforms its ANN counterpart by 3.13% and 0.12% on DVS-Gesture and ImageNet respectively, indicating that Spikeformer may be a more suitable architecture for training SNNs compared to CNN. We believe that this work shall promote the development of SNNs to be in step with ANNs as much as possible. Code will be publicly available.

SpikingMiniLM: Energy-Efficient Spiking Transformer for Natural Language Understanding

Towards Energy-Preserving Natural Language Understanding with Spiking Neural Networks

BitSNNs: Revisiting Energy-efficient Spiking Neural Networks

Efficient Structure Slimming for Spiking Neural Networks

Scaling Spike-driven Transformer with Efficient Spike Firing Approximation Training

Towards High-performance Spiking Transformers from ANN to SNN Conversion

Spike Trains Encoding and Threshold Rescaling Method for Deep Spiking Neural Networks

Spikeformer: Training high-performance spiking neural network with transformer

SpikeLLM: Scaling up Spiking Neural Network to Large Language Models via Saliency-based Spiking

Masked Spiking Transformer

BrainTransformers: SNN-LLM

Optimizing Power Efficiency: Converting Recurrent Neural Networks to Spiking Neural Networks for Time-Domain Analysis

Spiking Deep Residual Networks.

Training a General Spiking Neural Network with Improved Efficiency and Minimum Latency

SpikeLM: Towards General Spike-Driven Language Modeling via Elastic Bi-Spiking Mechanisms

SNNLP: Energy-Efficient Natural Language Processing Using Spiking Neural Networks

SpikingResformer: Bridging ResNet and Vision Transformer in Spiking Neural Networks

Spike-driven Transformer V2: Meta Spiking Neural Network Architecture Inspiring the Design of Next-generation Neuromorphic Chips

SpikeBERT: A Language Spikformer Learned from BERT with Knowledge Distillation

SNN-BERT: Training-efficient Spiking Neural Networks for energy-efficient BERT