Abstract:Background: An important problem in selective attention is determining the ways the primary visual cortex contributes to the encoding of bottom-up saliency and the types of neural computation that are effective to model this process. To address this problem, we constructed a two-layered network that satisfies the neurobiological constraints of the primary visual cortex to detect salient objects. We carried out experiments on both synthetic images and natural images to explore the influences of different factors, such as network structure, the size of each layer, the type of suppression and the combination strategy, on saliency detection performance. Results: The experimental results statistically demonstrated that the type and scale of filters contribute greatly to the encoding of bottom-up saliency. These two factors correspond to the mechanisms of invariant encoding and overcomplete representation in the primary visual cortex. Conclusions: (1) Instead of constructing Gabor functions or Gaussian pyramids filters for feature extraction as traditional attention models do, we learn overcomplete basis sets from natural images to extract features for saliency detection. Experiments show that given the proper layer size and a robust combination strategy, the learned overcomplete basis set outperforms a complete set and Gabor pyramids in visual saliency detection. This finding can potentially be applied in task-dependent and supervised object detection. (2) A hierarchical coding model that can represent invariant features, is designed for the pre-attentive stage of bottom-up attention. This coding model improves robustness to noises and distractions and improves the ability of detecting salient structures, such as collinear and co-circular structures, and several composite stimuli. This result indicates that invariant representation contributes to saliency detection (popping out) in bottom-up attention. The aforementioned perspectives will significantly contribute to the in-depth understanding of the information processing mechanism in the primary visual system.

Improvement of Salient-Region Detection Using an Integrated Bottom-Up Model

Salient Region Detection by Fusing Foreground and Background Cues Extracted from Single Image

A visual attention model combining top-down and bottom-up mechanisms for salient object detection

A Modified Selective Attention Model for Salient Region Detection in Real Scenes.

Salient region detection using high level feature

AN IMPROVED SALIENCY DETECTION ALGORITHM BASED ON ITTI'S MODEL

Visual Saliency Detection Based on Topographic Independent Component Analysis

Saliency Based on Cortex-Like Mechanisms.

Improving Bottom-up Saliency Detection by Looking into Neighbors

Modeling Bottom-Up Visual Attention for Color Images.

Salient region detection by fusing bottom-up and top-down features extracted from a single image.

Tag-Saliency: Combining Bottom-Up and Top-Down Information for Saliency Detection

A Visual Attention Based Object Detection Model beyond Top-Down and Bottom-up Mechanism

Salient region detection : Integrate both global and local cues

A neural computational model for bottom-up attention with invariant and overcomplete representation

Integrating Bottom-Up and Top-Down Visual Stimulus for Saliency Detection in News Video

MSGC: A New Bottom-Up Model for Salient Object Detection

Visual Attention Model with Cross-Layer Saliency Optimization

A biologically inspired computational model for image saliency detection.

Biologically Motivated Object Detection Based On Integration Of Top-Down And Bottom-Up Attention

A VISUAL ATTENTION MODEL FOR NATURAL SCENES BASED ON DYNAMIC FEATURE COMBINATION