A New Method For Spatiotemporal Textual Saliency Detection In Video

Susu Shan,Hailiang Xu,Feng Su
DOI: https://doi.org/10.1109/ICPR.2016.7900134
2016-01-01
Abstract:To detect salient image regions containing textual patterns in the video is valuable to many content-based video applications such as video retrieval, abstraction, classification and analysis. In this paper, we present an effective textual saliency detection method for natural scene videos. We first compute text-alike confidence values of local image regions, which capture the basic visual cues of textual components in the video frames, using an efficient cascaded prediction model. Next, we construct patch features depicting the statistical and spatial distribution of confidence values and combine them with general visual features like colors. We then employ a saliency detection model based on random walk with restart on the graph of local video regions, which effectively integrates both the spatial and the temporal saliency maps. The experiment result demonstrates the effectiveness of the proposed method.
What problem does this paper attempt to address?