Abstract:Spiking neural networks (SNNs) are increasingly applied to deep architectures. Recent works are developed to apply spatio-temporal backpropagation to directly train deep SNNs. But the binary and non-differentiable properties of spike activities force directly trained SNNs to suffer from serious gradient vanishing. In this paper, we first analyze the cause of the gradient vanishing problem and identify that the gradients mostly backpropagate along the synaptic currents. Based on that, we modify the synaptic current equation of leaky-integrate-fire neuron model and propose the improved LIF (IM-LIF) neuron model on the basis of the temporal-wise attention mechanism. We utilize the temporal-wise attention mechanism to selectively establish the connection between the current and historical response values, which can empirically enable the neuronal states to update resilient to the gradient vanishing problem. Furthermore, to capture the neuronal dynamics embedded in the output incorporating the IM-LIF model, we present a new temporal loss function to constrain the output of the network close to the target distribution. The proposed new temporal loss function could not only act as a regularizer to eliminate output outliers, but also assign the network loss credit to the voltage at a specific time point. Then we modify the ResNet and VGG architecture based on the IM-LIF model to build deep SNNs. We evaluate our work on image datasets and neuromorphic datasets. Experimental results and analysis show that our method can help build deep SNNs with competitive performance in both accuracy and latency, including 95.66% on CIFAR-10, 77.42% on CIFAR-100, 55.37% on Tiny-ImageNet, 97.33% on DVS-Gesture, and 80.50% on CIFAR-DVS with very few timesteps.

Shuttlenet: A Biologically-Inspired RNN with Loop Connection and Parameter Sharing.

Learning long-term dependencies for action recognition with a biologically-inspired deep network

Residual Recurrent Neural Networks for Learning Sequential Representations.

BiO-Net: Learning Recurrent Bi-directional Connections for Encoder-Decoder Architecture

Hierarchical Parameter Sharing In Recursive Neural Networks With Long Short-Term Memory

A New Hybrid-Parameter Recurrent Neural Network for Online Handwritten Chinese Character Recognition

DRRNets: Dynamic Recurrent Routing Via Low-Rank Regularization in Recurrent Neural Networks.

Biologically Inspired Structure Learning with Reverse Knowledge Distillation for Spiking Neural Networks

IM-LIF: Improved Neuronal Dynamics with Attention Mechanism for Direct Training Deep Spiking Neural Network

Deep RNN Framework for Visual Sequential Applications

DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTs

GLIF: A Unified Gated Leaky Integrate-and-Fire Neuron for Spiking Neural Networks

Biologically Inspired Heterogeneous Learning for Accurate, Efficient and Low-Latency Neural Network

Skip RNN: Learning to Skip State Updates in Recurrent Neural Networks

Parameters Sharing in Residual Neural Networks

On extended long short-term memory and dependent bidirectional recurrent neural network

Hierarchically Gated Recurrent Neural Network for Sequence Modeling

Recurrently Controlled Recurrent Networks

Long Short-Term Memory with Dynamic Skip Connections.

Loop Neural Networks for Parameter Sharing

An Improved Time Feedforward Connections Recurrent Neural Networks