Abstract:Background: An important problem in selective attention is determining the ways the primary visual cortex contributes to the encoding of bottom-up saliency and the types of neural computation that are effective to model this process. To address this problem, we constructed a two-layered network that satisfies the neurobiological constraints of the primary visual cortex to detect salient objects. We carried out experiments on both synthetic images and natural images to explore the influences of different factors, such as network structure, the size of each layer, the type of suppression and the combination strategy, on saliency detection performance. Results: The experimental results statistically demonstrated that the type and scale of filters contribute greatly to the encoding of bottom-up saliency. These two factors correspond to the mechanisms of invariant encoding and overcomplete representation in the primary visual cortex. Conclusions: (1) Instead of constructing Gabor functions or Gaussian pyramids filters for feature extraction as traditional attention models do, we learn overcomplete basis sets from natural images to extract features for saliency detection. Experiments show that given the proper layer size and a robust combination strategy, the learned overcomplete basis set outperforms a complete set and Gabor pyramids in visual saliency detection. This finding can potentially be applied in task-dependent and supervised object detection. (2) A hierarchical coding model that can represent invariant features, is designed for the pre-attentive stage of bottom-up attention. This coding model improves robustness to noises and distractions and improves the ability of detecting salient structures, such as collinear and co-circular structures, and several composite stimuli. This result indicates that invariant representation contributes to saliency detection (popping out) in bottom-up attention. The aforementioned perspectives will significantly contribute to the in-depth understanding of the information processing mechanism in the primary visual system.

Modeling Bottom-Up and Top-Down Attention with a Neurodynamic Model of V1

A Hierarchical Model of Visual Processing Simulates Neural Mechanisms Underlying Reflexive Attention.

A Neurodynamical Cortical Model of Visual Attention and Invariant Object Recognition

A neural computational model for bottom-up attention with invariant and overcomplete representation

A Neurodynamical Theory Of Visual Attention: Comparisons With Fmri- And Single-Neuron Data

Hebbian-Based Neural Networks for Bottom-Up Visual Attention Systems

Network Model Of Top-Down Influences On Local Gain And Contextual Interactions In Visual Cortex

Computational modelling of visual attention

The cortical neurodynamics of visual attention - a model

Modeling Attention and Binding in the Brain through Bidirectional Recurrent Gating

Visual Attention is Beyond One Single Saliency Map

A Novel Method to Study Bottom-up Visual Saliency and its Neural Mechanism

A cortical model with multi-layers to study visual attentional modulation of neurons at the synaptic level

Visual Attention Model Based on Statistical Properties of Neuron Responses.

Neural activities in v1 create a bottom-up saliency map.

A model of saliency-based visual attention for rapid scene analysis

Cognitive Neural Mechanisms and Saliency Computational Model of Visual Selective Attention

A Neural Network Model of Visual Attention Integrating Biased Competition and Reinforcement Learning

Top-Down Priors Disambiguate Target and Distractor Features in Simulated Covert Visual Search

A conceptual frame with two neural mechanisms to model selective visual attention processes

Flexible top-down modulation in human ventral temporal cortex