Abstract:When objects transform into different views, some properties are maintained, such as whether the edges are convex or concave, and these non-accidental properties are likely to be important in view-invariant object recognition. The metric properties, such as the degree of curvature, may change with different views, and are less likely to be useful in object recognition. It is shown that in a model of invariant visual object recognition in the ventral visual stream, VisNet, non-accidental properties are encoded much more than metric properties by neurons. Moreover, it is shown how with the temporal trace rule training in VisNet, non-accidental properties of objects become encoded by neurons, and how metric properties are treated invariantly. We also show how VisNet can generalize between different objects if they have the same non-accidental property, because the metric properties are likely to overlap. VisNet is a 4-layer unsupervised model of visual object recognition trained by competitive learning that utilizes a temporal trace learning rule to implement the learning of invariance using views that occur close together in time. A second crucial property of this model of object recognition is, when neurons in the level corresponding to the inferior temporal visual cortex respond selectively to objects, whether neurons in the intermediate layers can respond to combinations of features that may be parts of two or more objects. In an investigation using the four sides of a square presented in every possible combination, it was shown that even though different layer 4 neurons are tuned to encode each feature or feature combination orthogonally, neurons in the intermediate layers can respond to features or feature combinations present is several objects. This property is an important part of the way in which high capacity can be achieved in the four-layer ventral visual cortical pathway. These findings concerning non-accidental properties and the use of neurons in intermediate layers of the hierarchy help to emphasise fundamental underlying principles of the computations that may be implemented in the ventral cortical visual stream used in object recognition.

Finding and Recognizing Objects in Natural Scenes: Complementary Computations in the Dorsal and Ventral Visual Systems

A Neurodynamical Cortical Model of Visual Attention and Invariant Object Recognition

Invariant Visual Object and Face Recognition: Neural and Computational Bases, and a Model, VisNet

Effective Size of Receptive Fields of Inferior Temporal Visual Cortex Neurons in Natural Scenes.

Spatial Scene Representations Formed by Self‐organizing Learning in a Hippocampal Extension of the Ventral Visual System

Effective Size Of Receptive Fields Of Inferior Temporal Visual Cortex Neurons In Natural Scenes

Invariant Visual Object Recognition: Biologically Plausible Approaches

Attention in Natural Scenes: Neurophysiological and Computational Bases

Object Detection Based on Saturation of Visual Perception

A Neurodynamical Theory Of Visual Attention: Comparisons With Fmri- And Single-Neuron Data

A Model of Invariant Object Recognition in the Visual System: Learning Rules, Activation Functions, Lateral Inhibition, and Information-Based Performance Measures

Invariant visual object recognition: A model, with lighting invariance

A Neurophysiological and Computational Approach to the Functions of the Temporal Lobe Cortical Visual Areas in Invariant Object Recognition

Object recognition in primates: What can early visual areas contribute?

Invariant Object Recognition in the Visual System with Novel Views of 3D Objects

Non-accidental Properties, Metric Invariance, and Encoding by Neurons in a Model of Ventral Stream Visual Object Recognition, VisNet

A Model of Invariant Object Recognition in the Visual System

The cortical neurodynamics of visual attention - a model

A brain-inspired object-based attention network for multiobject recognition and visual reasoning

Fundamental principles of cortical computation: unsupervised learning with prediction, compression and feedback