Abstract:In this paper, a novel multiscale visual saliency detection algorithm combining spatiotemporal features and ant colony optimization is proposed. In the method, both the spatial information, such as luminance, chrominance and texture, and the temporal information, namely motion, are used to fulfill a better prediction of visual saliency. Besides, the information we use in the method are all extracted directly from the compressed video bitstreams to avoid the time-consuming decompressing process. The concept of multiscale is introduced. We use graphs of different scales constructed by dividing the video frames into blocks of different sizes to achieve more human-eye adaptability. Then the spatial features, namely luminance, chrominance and texture, are extracted directly from discrete cosine transform coefficients while the temporal information are extracted from the motion vectors to form the heuristic matrices. Next, the heuristic matrixes are used as part of the ant colony optimization process. Each heuristic matrix is used to steer the ants in the algorithm and the ants deposit pheromone on the graph. The pheromone is updated through attenuation and evaporation thus forming spatial/temporal saliency maps. Finally, the spatial and temporal saliency maps of each scale are fused together through adaptive fusion, and maps of different scales are fused through linear fusion. Since the model is constructed using information in compressed domain individually, the decompression process is avoided to save more time and to be suitable for videos transmitted on the network. Besides, the proposed method has been extensively tested on several video databases with sequences in various scenes. Through experiments it can be seen that in both quantitative evaluation scores and intuitive visual effects, the algorithm in this paper exhibits a better performance compared to the contrast methods in this paper.

Video Saliency Detection Incorporating Temporal Information in Compressed Domain

Visual saliency detection based on mutual information in compressed domain

Visual Saliency Detection Algorithm in Compressed HEVC Domain

A Video Saliency Detection Model in Compressed Domain

Video saliency detection in the compressed domain.

Video Saliency Detection Using Motion Saliency Filter

Motion Saliency Detection for Compressed Videos.

Video Saliency Detection Based on Mutual Information and Background Prior in Compressed Domain

Video Saliency from Compressed Domain Coding Length

Adaptive temporal compressive sensing for video with motion estimation

A Novel Video Saliency Map Detection Model in Compressed Domain.

Saliency Detection with Features from Compressed HEVC

Learning to Detect Video Saliency With HEVC Features.

A Fast and Efficient Saliency Detection Model in Video Compressed-Domain for Human Fixations Prediction

Video Saliency Map Detection Based on Global Motion Estimation.

Robust moving object segmentation in the compressed domain for H.264/AVC video stream

A Multiscale Compressed Video Saliency Detection Model Based on Ant Colony Optimization.

Saliency Detection in the Compressed Domain for Adaptive Image Retargeting

Visual Attention Based Video Object Segmentation in MPEG Compressed Domain

Video Saliency Detection Algorithm Based on 3D Transform Domain Spectral Difference

Video Saliency Detection Via Spatial-Temporal Fusion and Low-Rank Coherency Diffusion