Abstract:First, neurophysiological evidence for the learning of invariant representations in the inferior temporal visual cortex is described. This includes object and face representations with invariance for position, size, lighting, view and morphological transforms in the temporal lobe visual cortex; global object motion in the cortex in the superior temporal sulcus; and spatial view representations in the hippocampus that are invariant with respect to eye position, head direction, and place. Second, computational mechanisms that enable the brain to learn these invariant representations are proposed. For the ventral visual system, one key adaptation is the use of information available in the statistics of the environment in slow unsupervised learning to learn transform-invariant representations of objects. This contrasts with deep supervised learning in artificial neural networks, which uses training with thousands of exemplars forced into different categories by neuronal teachers. Similar slow learning principles apply to the learning of global object motion in the dorsal visual system leading to the cortex in the superior temporal sulcus. The learning rule that has been explored in VisNet is an associative rule with a short-term memory trace. The feed-forward architecture has four stages, with convergence from stage to stage. This type of slow learning is implemented in the brain in hierarchically organized competitive neuronal networks with convergence from stage to stage, with only 4-5 stages in the hierarchy. Slow learning is also shown to help the learning of coordinate transforms using gain modulation in the dorsal visual system extending into the parietal cortex and retrosplenial cortex. Representations are learned that are in allocentric spatial view coordinates of locations in the world and that are independent of eye position, head direction, and the place where the individual is located. This enables hippocampal spatial view cells to use idiothetic, self-motion, signals for navigation when the view details are obscured for short periods.

Invariant global motion recognition in the dorsal visual system: a unifying theory

Invariant visual object recognition: A model, with lighting invariance

Transformation of Spatiotemporal Dynamics in the Macaque Vestibular System from Otolith Afferents to Cortex

Invariant face and object recognition in the visual system.

Invariant Visual Object and Face Recognition: Neural and Computational Bases, and a Model, VisNet

Learning Invariant Object and Spatial View Representations in the Brain Using Slow Unsupervised Learning

Invariant Object Recognition in the Visual System with Novel Views of 3D Objects

A Model of Invariant Object Recognition in the Visual System

A Theory of the Visual Motion Coding in the Primary Visual Cortex

Learning Visual Features Under Motion Invariance

Invariant Visual Object Recognition: Biologically Plausible Approaches

Learning Invariant Object Recognition in the Visual System with Continuous Transformations

Pattern motion representation in primary visual cortex is mediated by transcortical feedback

Learning Transform Invariant Object Recognition in the Visual System with Multiple Stimuli Present During Training.

Distributed and retinotopically asymmetric processing of coherent motion in mouse visual cortex

Invariant recognition of feature combinations in the visual system

Hierarchical motion perception as causal inference

Mechanisms of Adaptive Spatial Integration in a Neural Model of Cortical Motion Processing

Learning intermediate-level representations of form and motion from natural movies

Learning to segment self-generated from externally caused optic flow through sensorimotor mismatch circuits

A Circuit for Integration of Head- and Visual-Motion Signals in Layer 6 of Mouse Primary Visual Cortex