Abstract:A defining characteristic of intelligent systems, whether natural or artificial, is the ability to generalize and infer behaviorally relevant latent causes from high-dimensional sensory input, despite significant variations in the environment. To understand how brains achieve generalization, it is crucial to identify the features to which neurons respond selectively and invariantly. However, the high-dimensional nature of visual inputs, the non-linearity of information processing in the brain, and limited experimental time make it challenging to systematically characterize neuronal tuning and invariances, especially for natural stimuli. Here, we extended "inception loops" - a paradigm that iterates between large-scale recordings, neural predictive models, and in silico experiments followed by in vivo verification - to systematically characterize single neuron invariances in the mouse primary visual cortex. Using the predictive model we synthesized Diverse Exciting Inputs (DEIs), a set of inputs that differ substantially from each other while each driving a target neuron strongly, and verified these DEIs' efficacy in vivo. We discovered a novel bipartite invariance: one portion of the receptive field encoded phase-invariant texture-like patterns, while the other portion encoded a fixed spatial pattern. Our analysis revealed that the division between the fixed and invariant portions of the receptive fields aligns with object boundaries defined by spatial frequency differences present in highly activating natural images. These findings suggest that bipartite invariance might play a role in segmentation by detecting texture-defined object boundaries, independent of the phase of the texture. We also replicated these bipartite DEIs in the functional connectomics MICrONs data set, which opens the way towards a circuit-level mechanistic understanding of this novel type of invariance. Our study demonstrates the power of using a data-driven deep learning approach to systematically characterize neuronal invariances. By applying this method across the visual hierarchy, cell types, and sensory modalities, we can decipher how latent variables are robustly extracted from natural scenes, leading to a deeper understanding of generalization.

Invariant Visual Object Recognition: Biologically Plausible Approaches

Invariant Visual Object and Face Recognition: Neural and Computational Bases, and a Model, VisNet

A Model of Invariant Object Recognition in the Visual System: Learning Rules, Activation Functions, Lateral Inhibition, and Information-Based Performance Measures

Non-accidental Properties, Metric Invariance, and Encoding by Neurons in a Model of Ventral Stream Visual Object Recognition, VisNet

Invariant Object Recognition in the Visual System with Novel Views of 3D Objects

Invariant face and object recognition in the visual system.

Invariant recognition of feature combinations in the visual system

Invariant visual object recognition: A model, with lighting invariance

A Model of Invariant Object Recognition in the Visual System

Learning Invariant Object and Spatial View Representations in the Brain Using Slow Unsupervised Learning

A Neurophysiological and Computational Approach to the Functions of the Temporal Lobe Cortical Visual Areas in Invariant Object Recognition

Learning Transform Invariant Object Recognition in the Visual System with Multiple Stimuli Present During Training.

Position Invariant Recognition In The Visual System With Cluttered Environments

A Neurodynamical Cortical Model of Visual Attention and Invariant Object Recognition

Invariant Object Recognition with Trace Learning and Multiple Stimuli Present During Training

Models of invariant object recognition

Invariant Object Recognition in the Visual System with Error Correction and Temporal Difference Learning

View-invariant representations of familiar objects by neurons in the inferior temporal visual cortex.

Bipartite invariance in mouse primary visual cortex

Humans and deep networks largely agree on which kinds of variation make object recognition harder

Building of Object View Invariance in a Newly-Discovered Network in Inferior Temporal Cortex