Abstract:In order to solve the problems of video saliency detection and poor fusion effect, a video saliency detection model and a fusion model are proposed. Video saliency detection is divided into spatial saliency detection and temporal saliency detection. In the spatial domain, inspired by the properties of visual cortex hierarchical perception and the Gestalt visual psychology, we propose a hierarchical saliency detection model with three-layer architecture for single frame image. The video single frame is simplified layer by layer, then the results are combined to form a whole consciousness vision object and become easier to deal with. At the bottom of the model, candidate saliency regions are formed by nonlinear simplification model of the characteristic image (dual color characteristic and luminance characteristic image), which is in accordance with the biological visual characteristic. In the middle of the model, the candidate regions with the strongest competitiveness are selected as the local salient regions according to the property of matrix minimum Fresenius- norm (F- norm). At the top level of the model, the local salient regions are integrated by the core theory of Gestalt visual psychology, and the spatial saliency map is obtained. In the time domain, based on the consistency assumption of a moving object in target location, motion range and direction, the optical flow points detected by Lucas-Kanade method are classified to eliminate the noise interference, then the motion saliency of moving object is measured by the motion amplitude. Finally, based on the difference between the visual sensitivity of dynamic and static information and the difference in visual sensitivity between color information and gray information, a general fusion model of time and spatial domain salient region is proposed. The saliency detection results of single frame image and video sequence frame image are represented by the gray color model and the Munsell color system respectively. Experimental results show that the proposed saliency detection method can suppress the background noise, solve the sparse pixels problem of a moving object, and can effectively detect the salient regions from the video. The proposed fusion model can display two kinds of saliency results simultaneously in a single picture of a complex scene. This model ensures that the detection results of images are so complicated that a chaotic situation will not appear.

Improving Video Saliency Detection Via Localized Estimation and Spatiotemporal Refinement.

Video Saliency Detection Using Deep Convolutional Neural Networks.

Video Saliency Detection Via Spatial-Temporal Fusion and Low-Rank Coherency Diffusion

Saliency Flow Based Video Segmentation Via Motion Guided Contour Refinement.

Salient Region Detection by Fusing Foreground and Background Cues Extracted from Single Image

Video Saliency Detection via Dynamic Consistent Spatio-Temporal Attention Modelling.

Video Saliency Detection Using Spatiotemporal Cues

Accurate and Robust Video Saliency Detection Via Self-Paced Diffusion.

Video Saliency Map Detection Based on Global Motion Estimation.

Nonlocal Feature Transform with Iterative Refinement for Video Saliency Detection

Video Saliency Detection Via Sparsity-Based Reconstruction and Propagation.

Improved Robust Video Saliency Detection Based on Long-Term Spatial-Temporal Information

Video Saliency Detection Using Motion Saliency Filter

Spatiotemporal Saliency Based on Location Prior Model.

Video Saliency Detection Using Dynamic Fusion of Spatial-Temporal Features in Complex Background with Disturbance

Video Saliency Prediction Via Joint Discrimination and Local Consistency

Video Saliency Detection Algorithm Based on Biological Visual Feature and Visual Psychology Theory

A Deep-Learning Based Feature Hybrid Framework for Spatiotemporal Saliency Detection Inside Videos

Superpixel Level Incremental Visual Saliency Detection in Low Contrast Video

A Video Salient Object Detection Model Guided By Spatio-Temporal Prior

Improving RGBD Saliency Detection Using Progressive Region Classification and Saliency Fusion.