Abstract:Many cognitive problems can be decomposed into series of subproblems that are solved sequentially by the brain. When subproblems are solved, relevant intermediate results need to be stored by neurons and propagated to the next subproblem, until the overarching goal has been completed. We will here consider visual tasks, which can be decomposed into sequences of elemental visual operations. Experimental evidence suggests that intermediate results of the elemental operations are stored in working memory as an enhancement of neural activity in the visual cortex. The focus of enhanced activity is then available for subsequent operations to act upon. The main question at stake is how the elemental operations and their sequencing can emerge in neural networks that are trained with only rewards, in a reinforcement learning setting. We here propose a new recurrent neural network architecture that can learn composite visual tasks that require the application of successive elemental operations. Specifically, we selected three tasks for which electrophysiological recordings of monkeys' visual cortex are available. To train the networks, we used RELEARNN, a biologically plausible four-factor Hebbian learning rule, which is local both in time and space. We report that networks learn elemental operations, such as contour grouping and visual search, and execute sequences of operations, solely based on the characteristics of the visual stimuli and the reward structure of a task. After training was completed, the activity of the units of the neural network elicited by behaviorally relevant image items was stronger than that elicited by irrelevant ones, just as has been observed in the visual cortex of monkeys solving the same tasks. Relevant information that needed to be exchanged between subroutines was maintained as a focus of enhanced activity and passed on to the subsequent subroutines. Our results demonstrate how a biologically plausible learning rule can train a recurrent neural network on multistep visual tasks. Many visual problems, like finding your way on a map, are solved by decomposing them into a series of subproblems. For a successful decomposition, the neuronal processes that solve one subproblem must make their results available to the subsequent ones. Experiments in monkeys demonstrated that the outcomes of subproblems are represented as foci of enhanced activity in the visual cortex, which are related to attention shifts and can be used as inputs for the processes solving the next subproblems. To understand how subproblems and their sequencing are learned, we trained a recurrent artificial neural network on the same tasks that monkeys performed, using a biologically plausible reinforcement learning rule. The networks learned the tasks and, importantly, the activation of the networks' units resembled the spatiotemporal patterns of activity observed in the visual cortex of monkeys. Our results shed light on how recurrent neural networks trained with a biologically plausible learning rule can learn to propagate enhanced activity between subroutines to solve complex visual tasks.

Recurrent Joint Embedding Predictive Architecture with Recurrent Forward Propagation Learning

Long-Term Recurrent Convolutional Networks for Visual Recognition and Description

Recurrent networks improve neural response prediction and provide insights into underlying cortical circuits

Recurrent Models of Visual Attention

Recurrent Feedback Improves Feedforward Representations in Deep Neural Networks

Recurrent network dynamics reconciles visual motion segmentation and integration

Human Eyes Inspired Recurrent Neural Networks are More Robust Against Adversarial Noises

Embodied Neuromorphic Vision with Event-Driven Random Backpropagation

Human Eyes-Inspired Recurrent Neural Networks Are More Robust Against Adversarial Noises

ViR: Towards Efficient Vision Retention Backbones

Task-Driven Convolutional Recurrent Models of the Visual System

Progressive Recurrent Learning for Visual Recognition.

Recurrent issues with deep neural networks of visual recognition

Recurrent neural networks that learn multi-step visual routines with reinforcement learning

Reproducibility of predictive networks for mouse visual cortex

Biologically Inspired Model for Visual Cognition Achieving Unsupervised Episodic and Semantic Feature Learning.

Learning Spatial Common Sense with Geometry-Aware Recurrent Networks

Learning and Unlearning: Bridging classification, memory and generative modeling in Recurrent Neural Networks

Recurrent Neural Network for Learning DenseDepth and Ego-Motion from Video

Recurrent Feedback Improves Recognition of Partially Occluded Objects

Unraveling Neural Coding of Dynamic Natural Visual Scenes Via Convolutional Recurrent Neural Networks.