Abstract:Recurrent neural networks (RNNs) provide state-of-the-art performances in a wide variety of tasks that require memory. These performances can often be achieved thanks to gated recurrent cells such as gated recurrent units (GRU) and long short-term memory (LSTM). Standard gated cells share a layer internal state to store information at the network level, and long term memory is shaped by network-wide recurrent connection weights. Biological neurons on the other hand are capable of holding information at the cellular level for an arbitrary long amount of time through a process called bistability. Through bistability, cells can stabilize to different stable states depending on their own past state and inputs, which permits the durable storing of past information in neuron state. In this work, we take inspiration from biological neuron bistability to embed RNNs with long-lasting memory at the cellular level. This leads to the introduction of a new bistable biologically-inspired recurrent cell that is shown to strongly improves RNN performance on time-series which require very long memory, despite using only cellular connections (all recurrent connections are from neurons to themselves, i.e. a neuron state is not influenced by the state of other neurons). Furthermore, equipping this cell with recurrent neuromodulation permits to link them to standard GRU cells, taking a step towards the biological plausibility of GRU. With this link, this work paves the way for studying more complex and biologically plausible neuromodulation schemes as gating mechanisms in RNNs.

Working Memory Connections for LSTM

LSTM with Working Memory

ELSTM: An improved long short‐term memory network language model for sequence learning

Memory Visualization for Gated Recurrent Neural Networks in Speech Recognition

Learning to Forget: Continual Prediction with LSTM

Persistence pays off: Paying Attention to What the LSTM Gating Mechanism Persists

A bio-inspired bistable recurrent cell allows for long-lasting memory

Simplified Gating in Long Short-term Memory (LSTM) Recurrent Neural Networks

A New Delay Connection for Long Short-Term Memory Networks.

xLSTM: Extended Long Short-Term Memory

Learning Sparse Hidden States In Long Short-Term Memory

A Modified Long Short-Term Memory Cell

On extended long short-term memory and dependent bidirectional recurrent neural network

Gating Recurrent Enhanced Memory Neural Networks on Language Identification.

Long short-term memory networks in memristor crossbars

Long short-term memory networks in memristor crossbar arrays

Semi-tied Units for Efficient Gating in LSTM and Highway Networks

Cortical microcircuits as gated-recurrent neural networks

Long Short-Term Graph Memory Against Class-imbalanced Over-smoothing

NEWLSTM: an Optimized Long Short-Term Memory Language Model for Sequence Prediction.

Towards Binary-Valued Gates for Robust LSTM Training