Nonlocal Feature Transform with Iterative Refinement for Video Saliency Detection

Kai Tian,Zongqing Lu,Qingmin Liao
DOI: https://doi.org/10.1145/3194206.3194232
2018-01-01
Abstract:This paper proposes an effective approach to detect video saliency in unconstrained videos. Our approach explores mutual information among nonlocal frames, and feature differences between salient regions and backgrounds. First, motion saliency is computed by superpixel-level motion distinction from frame-level's. Second, we extract definite foreground and background samples. Each sample is described by high dimensional features. In human perception, salient regions have distinctive features from backgrounds such as color and texture. We attempt to learn optimal linear coefficients to separate the salient regions and backgrounds based on these samples. Finally, image filtering combined with generalized boundary is performed to refine the saliency map. Our approach yields high accurate saliency maps with well-defined boundaries. Quantitative and qualitative experiments are carried out on three benchmark video datasets, which show that our approach achieves state-of-the-art performance in video saliency detection.
What problem does this paper attempt to address?