Abstract:When objects transform into different views, some properties are maintained, such as whether the edges are convex or concave, and these non-accidental properties are likely to be important in view-invariant object recognition. The metric properties, such as the degree of curvature, may change with different views, and are less likely to be useful in object recognition. It is shown that in a model of invariant visual object recognition in the ventral visual stream, VisNet, non-accidental properties are encoded much more than metric properties by neurons. Moreover, it is shown how with the temporal trace rule training in VisNet, non-accidental properties of objects become encoded by neurons, and how metric properties are treated invariantly. We also show how VisNet can generalize between different objects if they have the same non-accidental property, because the metric properties are likely to overlap. VisNet is a 4-layer unsupervised model of visual object recognition trained by competitive learning that utilizes a temporal trace learning rule to implement the learning of invariance using views that occur close together in time. A second crucial property of this model of object recognition is, when neurons in the level corresponding to the inferior temporal visual cortex respond selectively to objects, whether neurons in the intermediate layers can respond to combinations of features that may be parts of two or more objects. In an investigation using the four sides of a square presented in every possible combination, it was shown that even though different layer 4 neurons are tuned to encode each feature or feature combination orthogonally, neurons in the intermediate layers can respond to features or feature combinations present is several objects. This property is an important part of the way in which high capacity can be achieved in the four-layer ventral visual cortical pathway. These findings concerning non-accidental properties and the use of neurons in intermediate layers of the hierarchy help to emphasise fundamental underlying principles of the computations that may be implemented in the ventral cortical visual stream used in object recognition.

Invariant Visual Object and Face Recognition: Neural and Computational Bases, and a Model, VisNet

Invariant Visual Object Recognition: Biologically Plausible Approaches

Invariant face and object recognition in the visual system.

Invariant visual object recognition: A model, with lighting invariance

A Model of Invariant Object Recognition in the Visual System: Learning Rules, Activation Functions, Lateral Inhibition, and Information-Based Performance Measures

A Model of Invariant Object Recognition in the Visual System

Non-accidental Properties, Metric Invariance, and Encoding by Neurons in a Model of Ventral Stream Visual Object Recognition, VisNet

Invariant Object Recognition in the Visual System with Novel Views of 3D Objects

Learning Invariant Object and Spatial View Representations in the Brain Using Slow Unsupervised Learning

A Neurophysiological and Computational Approach to the Functions of the Temporal Lobe Cortical Visual Areas in Invariant Object Recognition

A Neurodynamical Cortical Model of Visual Attention and Invariant Object Recognition

Learning mechanisms in the temporal lobe visual cortex

Learning Transform Invariant Object Recognition in the Visual System with Multiple Stimuli Present During Training.

Invariant recognition of feature combinations in the visual system

Models of invariant object recognition

Spatial Scene Representations Formed by Self‐organizing Learning in a Hippocampal Extension of the Ventral Visual System

Invariant Object Recognition with Trace Learning and Multiple Stimuli Present During Training

Finding and Recognizing Objects in Natural Scenes: Complementary Computations in the Dorsal and Ventral Visual Systems

The representation of information about faces in the temporal and frontal lobes

Vision, Emotion and Memory: from Neurophysiology to Computation

The Neurophysiology and Computational Mechanisms of Object Representation