Abstract:The proliferation of Artificial Neural Networks (ANNs) has led to increased energy consumption, raising concerns about their sustainability. Spiking Neural Networks (SNNs), which are inspired by biological neural systems and operate using sparse, event-driven spikes to communicate information between neurons, offer a potential solution due to their lower energy requirements. An alternative technique for reducing a neural network's footprint is quantization, which compresses weight representations to decrease memory usage and energy consumption. In this study, we present Twin Network Augmentation (TNA), a novel training framework aimed at improving the performance of SNNs while also facilitating an enhanced compression through low-precision quantization of weights. TNA involves co-training an SNN with a twin network, optimizing both networks to minimize their cross-entropy losses and the mean squared error between their output logits. We demonstrate that TNA significantly enhances classification performance across various vision datasets and in addition is particularly effective when applied when reducing SNNs to ternary weight precision. Notably, during inference , only the ternary SNN is retained, significantly reducing the network in number of neurons, connectivity and weight size representation. Our results show that TNA outperforms traditional knowledge distillation methods and achieves state-of-the-art performance for the evaluated network architecture on benchmark datasets, including CIFAR-10, CIFAR-100, and CIFAR-10-DVS. This paper underscores the effectiveness of TNA in bridging the performance gap between SNNs and ANNs and suggests further exploration into the application of TNA in different network architectures and datasets.

What problem does this paper attempt to address?

The main problems this paper attempts to address are: 1. **Improving the performance of Spiking Neural Networks (SNNs)**: Although SNNs have significant advantages in energy efficiency over traditional Artificial Neural Networks (ANNs), their performance on various benchmark tasks is still inferior to ANNs. The paper proposes a new training method—Twin Network Augmentation (TNA), which aims to enhance the classification performance of SNNs by co-training a twin network with the same structure as the base SNN. 2. **Achieving efficient weight quantization**: The paper also explores how to maintain or even improve the performance of SNNs when compressed to low-precision weights (e.g., ternary weights). Through the TNA method, high classification accuracy can be maintained even after the network is compressed. Specifically, the paper addresses these problems through the following approaches: - **Introducing the TNA method**: During the training phase, a base SNN and a randomly initialized twin SNN are trained simultaneously. The optimization objectives include minimizing the cross-entropy loss of both networks and the mean squared error between their output logits (logit matching loss). This helps to enhance the model's generalization and representation capabilities. - **Application to ternary weight SNNs**: During training, full-precision weights are used initially, and after a certain number of training epochs, the network is compressed to ternary weights and training continues to fine-tune the model. Experimental results show that this method significantly improves classification performance on multiple datasets, particularly on more challenging datasets such as CIFAR-100 and CIFAR-10-DVS. In summary, by proposing the TNA method, the paper not only improves the classification performance of SNNs but also makes significant progress in weight quantization, providing a new solution for the application of low-power neuromorphic hardware.

Twin Network Augmentation: A Novel Training Strategy for Improved Spiking Neural Networks and Efficient Weight Quantization

BitSNNs: Revisiting Energy-efficient Spiking Neural Networks

Low Precision Quantization-aware Training in Spiking Neural Networks with Differentiable Quantization Function

Spike Trains Encoding and Threshold Rescaling Method for Deep Spiking Neural Networks

Toward High-Accuracy and Low-Latency Spiking Neural Networks With Two-Stage Optimization

Deep Spiking Neural Networks with Binary Weights for Object Recognition

You Only Spike Once: Improving Energy-Efficient Neuromorphic Inference to ANN-Level Accuracy

SQUAT: Stateful Quantization-Aware Training in Recurrent Spiking Neural Networks

A Novel Conversion Method for Spiking Neural Network Using Median Quantization

Scaling Spike-driven Transformer with Efficient Spike Firing Approximation Training

Joint A-SNN: Joint Training of Artificial and Spiking Neural Networks via Self-Distillation and Weight Factorization

Neurogenesis Dynamics-inspired Spiking Neural Network Training Acceleration

A TTFS-based energy and utilization efficient neuromorphic CNN accelerator

AT-SNN: Adaptive Tokens for Vision Transformer on Spiking Neural Network

Training a General Spiking Neural Network with Improved Efficiency and Minimum Latency

Towards Lossless ANN-SNN Conversion under Ultra-Low Latency with Dual-Phase Optimization

Training much deeper spiking neural networks with a small number of time-steps

Trimming Down Large Spiking Vision Transformers via Heterogeneous Quantization Search

TT-SNN: Tensor Train Decomposition for Efficient Spiking Neural Network Training

Temporal Efficient Training of Spiking Neural Network via Gradient Re-weighting

Ternary Spike: Learning Ternary Spikes for Spiking Neural Networks