ELiSe: Efficient Learning of Sequences in Structured Recurrent Networks

Laura Kriener,Kristin Völk,Ben von Hünerbein,Federico Benitez,Walter Senn,Mihai A. Petrovici
2024-09-27
Abstract:Behavior can be described as a temporal sequence of actions driven by neural activity. To learn complex sequential patterns in neural networks, memories of past activities need to persist on significantly longer timescales than the relaxation times of single-neuron activity. While recurrent networks can produce such long transients, training these networks is a challenge. Learning via error propagation confers models such as FORCE, RTRL or BPTT a significant functional advantage, but at the expense of biological plausibility. While reservoir computing circumvents this issue by learning only the readout weights, it does not scale well with problem complexity. We propose that two prominent structural features of cortical networks can alleviate these issues: the presence of a certain network scaffold at the onset of learning and the existence of dendritic compartments for enhancing neuronal information storage and computation. Our resulting model for Efficient Learning of Sequences (ELiSe) builds on these features to acquire and replay complex non-Markovian spatio-temporal patterns using only local, always-on and phase-free synaptic plasticity. We showcase the capabilities of ELiSe in a mock-up of birdsong learning, and demonstrate its flexibility with respect to parametrization, as well as its robustness to external disturbances.
Neurons and Cognition,Artificial Intelligence,Neural and Evolutionary Computing
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to learn complex spatio - temporal sequences efficiently and biologically plausibly in neural networks. Specifically, the authors propose a new model - Efficient Learning of Sequences (ELiSe), aiming to overcome the challenges encountered by existing models when learning complex sequences. These challenges mainly include: 1. **Gap in time scales**: Behavior occurs on a time scale of seconds, while neural activity changes on a time scale of milliseconds. In order to learn complex sequence patterns, it is necessary to remember past activities, and these memories need to last for a much longer time scale than the relaxation time of a single neuron's activity. 2. **Challenges in training**: Although recurrent networks can produce the required long - time transients, training these networks is a difficult problem. Methods of learning through error propagation (such as FORCE, RTRL or BPTT) are powerful, but have flaws in terms of biological plausibility. For example, these methods usually require global error signals or non - local quantities, which are difficult to achieve biologically. 3. **Resource efficiency**: Existing methods, such as reservoir computing, although avoiding the problem of biological plausibility, their performance will decline significantly as the complexity of the problem increases, because reservoir computing depends on the emergence of useful transients in random activities, which is difficult to guarantee in complex tasks. To solve the above problems, the ELiSe model introduces two main structural features: - **Existence of network scaffolds**: Before the start of learning, there is already a specific structural scaffold in the network, which helps to propagate teaching signals throughout the network in a structured manner. - **Dendritic compartments**: Enhance the information storage and computing capabilities of individual neurons, so that the learning mechanism can use only local, always - on and non - phasic synaptic plasticity rules. Through these improvements, the ELiSe model can efficiently learn and replay complex non - Markovian sequences with a small number of neurons, and is robust to parameter changes and external disturbances.