Abstract:First, neurophysiological evidence for the learning of invariant representations in the inferior temporal visual cortex is described. This includes object and face representations with invariance for position, size, lighting, view and morphological transforms in the temporal lobe visual cortex; global object motion in the cortex in the superior temporal sulcus; and spatial view representations in the hippocampus that are invariant with respect to eye position, head direction, and place. Second, computational mechanisms that enable the brain to learn these invariant representations are proposed. For the ventral visual system, one key adaptation is the use of information available in the statistics of the environment in slow unsupervised learning to learn transform-invariant representations of objects. This contrasts with deep supervised learning in artificial neural networks, which uses training with thousands of exemplars forced into different categories by neuronal teachers. Similar slow learning principles apply to the learning of global object motion in the dorsal visual system leading to the cortex in the superior temporal sulcus. The learning rule that has been explored in VisNet is an associative rule with a short-term memory trace. The feed-forward architecture has four stages, with convergence from stage to stage. This type of slow learning is implemented in the brain in hierarchically organized competitive neuronal networks with convergence from stage to stage, with only 4-5 stages in the hierarchy. Slow learning is also shown to help the learning of coordinate transforms using gain modulation in the dorsal visual system extending into the parietal cortex and retrosplenial cortex. Representations are learned that are in allocentric spatial view coordinates of locations in the world and that are independent of eye position, head direction, and the place where the individual is located. This enables hippocampal spatial view cells to use idiothetic, self-motion, signals for navigation when the view details are obscured for short periods.

Learning Invariant Object Recognition in the Visual System with Continuous Transformations

Continuous Transformation Learning of Translation Invariant Representations

Spatial vs temporal continuity in view invariant visual object recognition learning.

Learning Transform Invariant Object Recognition in the Visual System with Multiple Stimuli Present During Training.

Learning Invariant Responses to the Natural Transformations of Objects

Learning Invariant Object and Spatial View Representations in the Brain Using Slow Unsupervised Learning

Invariant visual object recognition: A model, with lighting invariance

Invariant Object Recognition in the Visual System with Novel Views of 3D Objects

Invariant Object Recognition in the Visual System with Error Correction and Temporal Difference Learning

Invariant face and object recognition in the visual system.

A Model of Invariant Object Recognition in the Visual System

Invariant Visual Object and Face Recognition: Neural and Computational Bases, and a Model, VisNet

Invariant Object Recognition with Trace Learning and Multiple Stimuli Present During Training

Learning mechanisms in the temporal lobe visual cortex

Invariant recognition of feature combinations in the visual system

A Model of Invariant Object Recognition in the Visual System: Learning Rules, Activation Functions, Lateral Inhibition, and Information-Based Performance Measures

Spatial Scene Representations Formed by Self‐organizing Learning in a Hippocampal Extension of the Ventral Visual System

Transform-Invariant Recognition by Association in a Recurrent Network

Invariant global motion recognition in the dorsal visual system: a unifying theory

Invariant Visual Object Recognition: Biologically Plausible Approaches

Deformation-specific and Deformation-Invariant Visual Object Recognition: Pose Vs. Identity Recognition of People and Deforming Objects.