Abstract:Abstract There are two distinct classes of cells in the primary visual cortex (V1): simple cells and complex cells. One defining feature of complex cells is their spatial phase invariance; they respond strongly to oriented grating stimuli with a preferred orientation but with a wide range of spatial phases. A classical model of complete spatial phase invariance in complex cells is the energy model, in which the responses are the sum of the squared outputs of two linear spatially phase-shifted filters. However, recent experimental studies have shown that complex cells have a diverse range of spatial phase invariance and only a subset can be characterized by the energy model. While several models have been proposed to explain how complex cells could learn to be selective to orientation but invariant to spatial phase, most existing models overlook many biologically important details. We propose a biologically plausible model for complex cells that learns to pool inputs from simple cells based on the presentation of natural scene stimuli. The model is a three-layer network with rate-based neurons that describes the activities of LGN cells (layer 1), V1 simple cells (layer 2), and V1 complex cells (layer 3). The first two layers implement a recently proposed simple cell model that is biologically plausible and accounts for many experimental phenomena. The neural dynamics of the complex cells is modeled as the integration of simple cells inputs along with response normalization. Connections between LGN and simple cells are learned using Hebbian and anti-Hebbian plasticity. Connections between simple and complex cells are learned using a modified version of the Bienenstock, Cooper, and Munro (BCM) rule. Our results demonstrate that the learning rule can describe a diversity of complex cells, similar to those observed experimentally. Author summary Many cortical functions originate from the learning ability of the brain. How the properties of cortical cells are learned is vital for understanding how the brain works. There are many models that explain how V1 simple cells can be learned. However, how V1 complex cells are learned still remains unclear. In this paper, we propose a model of learning in complex cells based on the Bienenstock, Cooper, and Munro (BCM) rule. We demonstrate that properties of receptive fields of complex cells can be learned using this biologically plausible learning rule. Quantitative comparisons between the model and experimental data are performed. Results show that model complex cells can account for the diversity of complex cells found in experimental studies. In summary, this study provides a plausible explanation for how complex cells can be learned using biologically plausible plasticity mechanisms. Our findings help us to better understand biological vision processing and provide us with insights into the general signal processing principles that the visual cortex employs to process visual information.

Learning V1 Simple Cells with Vector Representation of Local Content and Matrix Representation of Local Motion

Learning vector representation of local content and matrix representation of local motion, with implications for V1

Leveraging Local Planar Motion Property for Robust Visual Matching and Localization.

Recurrent Volume-Based 3-D Feature Fusion for Real-Time Multiview Object Pose Estimation.

3D Model-free Visual Localization System from Essential Matrix under Local Planar Motion

Recurrent Volume-based 3D Feature Fusion for Real-time Multi-view Object Pose Estimation

Learning Grid Cells as Vector Representation of Self-Position Coupled with Matrix Representation of Self-Motion

Grid Cell Modeling with Mapping Representation of Self-Motion for Path Integration

Egg Donor Informed Consent Tool (EDICT): development and validation of a new informed consent tool for oocyte donors.

Learning intermediate-level representations of form and motion from natural movies

Emergence of Complex-Like Cells in a Temporal Product Network with Local Receptive Fields

Learning Neural Representation of Camera Pose with Matrix Representation of Pose Shift via View Synthesis

Modelling Human Visual Motion Processing with Trainable Motion Energy Sensing and a Self-attention Network

Learning receptive field properties of complex cells in V1

Multi-Hypothesis Visual-Inertial Flow

GaborNet Visual Encoding: A Lightweight Region-Based Visual Encoding Model With Good Expressiveness and Biological Interpretability

Aligning Neuronal Coding of Dynamic Visual Scenes with Foundation Vision Models

Motion-I2V: Consistent and Controllable Image-to-Video Generation with Explicit Motion Modeling

A Representational Model of Grid Cells Based on Matrix Lie Algebras.

Simple and complex cells revisited: toward a selectivity-invariance model of object recognition

RIAV-MVS: Recurrent-Indexing an Asymmetric Volume for Multi-View Stereo