Video Saliency Detection via Dynamic Consistent Spatio-Temporal Attention Modelling.

Sheng-hua Zhong,Yan Liu,Feifei Ren,Jinghuan Zhang,Tongwei Ren
DOI: https://doi.org/10.1609/aaai.v27i1.8642
2013-01-01
Abstract:Human vision system actively seeks salient regions and movements in video sequences to reduce the search effort. Modeling computational visual saliency map provides important information for semantic understanding in many real world applications. In this paper, we propose a novel video saliency detection model for detecting the attended regions that correspond to both interesting objects and dominant motions in video sequences. In spatial saliency map, we inherit the classical bottom-up spatial saliency map. In temporal saliency map, a novel optical flow model is proposed based on the dynamic consistency of motion. The spatial and the temporal saliency maps are constructed and further fused together to create a novel attention model. The proposed attention model is evaluated on three video datasets. Empirical validations demonstrate the salient regions detected by our dynamic consistent saliency map highlight the interesting objects effectively and efficiency. More importantly, the automatically video attended regions detected by proposed attention model are consistent with the ground truth saliency maps of eye movement data. © 2013, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved.
What problem does this paper attempt to address?