Efficient Online Learning for Networks of Two-Compartment Spiking Neurons

Yujia Yin,Xinyi Chen,Chenxiang Ma,Jibin Wu,Kay Chen Tan
2024-02-25
Abstract:The brain-inspired Spiking Neural Networks (SNNs) have garnered considerable research interest due to their superior performance and energy efficiency in processing temporal signals. Recently, a novel multi-compartment spiking neuron model, namely the Two-Compartment LIF (TC-LIF) model, has been proposed and exhibited a remarkable capacity for sequential modelling. However, training the TC-LIF model presents challenges stemming from the large memory consumption and the issue of gradient vanishing associated with the Backpropagation Through Time (BPTT) algorithm. To address these challenges, online learning methodologies emerge as a promising solution. Yet, to date, the application of online learning methods in SNNs has been predominantly confined to simplified Leaky Integrate-and-Fire (LIF) neuron models. In this paper, we present a novel online learning method specifically tailored for networks of TC-LIF neurons. Additionally, we propose a refined TC-LIF neuron model called Adaptive TC-LIF, which is carefully designed to enhance temporal information integration in online learning scenarios. Extensive experiments, conducted on various sequential benchmarks, demonstrate that our approach successfully preserves the superior sequential modeling capabilities of the TC-LIF neuron while incorporating the training efficiency and hardware friendliness of online learning. As a result, it offers a multitude of opportunities to leverage neuromorphic solutions for processing temporal signals.
Neural and Evolutionary Computing
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is the challenges faced by the existing spiking neural networks (SNNs) based on the two - compartment leaky integrate - and - fire (TC - LIF) neuron model when trained using the traditional backpropagation through time (BPTT) algorithm. Specifically, these challenges include: 1. **High memory consumption**: The BPTT algorithm needs to store all internal states within the entire time window for gradient update, which leads to a linearly increasing memory requirement as the time window increases. 2. **Vanishing gradient problem**: The BPTT algorithm is prone to the problem of vanishing or exploding gradients, which complicates stable training and requires careful parameter initialization. 3. **Not suitable for online learning**: The BPTT algorithm requires that the entire sequence be available before training can begin, so it is not suitable for online learning scenarios. To solve these problems, the authors propose a new online learning method, which is specifically optimized for TC - LIF neuron networks. In addition, they also propose an improved TC - LIF neuron model, called Adaptive TC - LIF, which aims to enhance the ability to integrate temporal information in online learning scenarios. The following are the specific contributions of this study: 1. **Extension of the e - prop algorithm**: The e - prop online learning method originally designed for LIF neurons is extended to multi - compartment TC - LIF neurons, and detailed mathematical derivations are provided. 2. **Introduction of the Adaptive TC - LIF model**: A new model that includes a time - varying membrane potential decay constant is proposed to improve the ability to integrate temporal information in online learning. 3. **Experimental verification**: Through a series of experiments on sequential modeling tasks, it is proved that the proposed Adaptive TC - LIF model has excellent sequential modeling ability, efficient training performance, and is friendly to neuromorphic hardware. ### Formula summary 1. **Dynamics equation of TC - LIF neurons**: \[ v_D^j[t + 1]=v_D^j[t]+\beta_1 v_S^j[t]-\gamma z^j[t]+I^j[t + 1] \] \[ v_S^j[t + 1]=v_S^j[t]+\beta_2 v_D^j[t + 1]-v_{th}z^j[t] \] \[ I^j[t + 1]=\sum_{i\neq j}W_{rec}^{ji}z^i[t]+\sum_i W_{in}^{ji}x^i[t + 1]+b \] \[ z^j[t + 1]=H(v_S^j[t + 1]-v_{th}) \] 2. **Eligibility Trace in the e - prop algorithm**: \[ e^{ji}[t]=\frac{\partial z^j[t]}{\partial h^j[t]}\cdot\varepsilon^{ji}[t] \] where, \[ \varepsilon^{ji}[t]=\frac{\partial h^j[t]}{\partial W^{ji}}+\frac{\partial h^j[t]}{\partial h^j[t - 1]}\cdot\varepsilon^{ji}[t - 1] \] 3. **Dynamics equation of the Adaptive TC - LIF model**: \[ v_D^j[t + 1]=A_D[t]\cdot v_D^j[t]+\beta_1 v_S^j[t]-\gamma z^j[t]+I^j[t + 1] \] \[ v_S^j[t + 1]=A_S[t]\cdot v_S^j[t]+\beta_2 v_D^j[t + 1]-v_{th}z^j[t] \] where, \[ A_D[t]=\t