Abstract:Text detection and tracking in video is challenging due to contrast, resolution and background variations, and different orientations and text movements. In addition, the presence of both caption and scene texts in video aggravates the problem because these two text types differ in characteristics significantly . This paper proposes a new technique for detecting and tracking video texts of any orientation by using spatial and temporal information, respectively. The technique explores gradient directional symmetry at component level for smoothing edge components before text detection. Spatial information is preserved by forming Delaunay triangulation in a novel way at this level, which results in text candidates. Text characteristics are then proposed in a different way for eliminating false text candidates , which results in potential text candidates. Then grouping is proposed for combining potential text candidates regardless of orientation based on the nearest neighbor criterion. To tackle the problems of multi-font and multi-sized texts, we propose multi-scale integration by a pyramid structure, which helps in extracting full text lines. Then, the detected text lines are tracked in video by matching the subgraphs of triangulation. Experimental results for text detection and tracking on our video dataset, the benchmark video datasets, and the natural scene image benchmark datasets show that the proposed method is superior to the state-of-the-art methods in terms of recall, precision , and F-measure.

A New Method For Spatiotemporal Textual Saliency Detection In Video

A new video text detection method.

Video Saliency Detection Using Motion Saliency Filter

Predictive Video Saliency Detection.

Color and motion information fusion based saliency filter for video

Video Identification Using Spatio-temporal Salient Points

Video saliency detection based on robust seeds generation and spatio-temporal propagation

Motion-Aware Rapid Video Saliency Detection

Video Saliency Detection Algorithm Based on Motion Spectral Residual

Video-based Salient Object Detection Via Spatio-Temporal Difference and Coherence

Video Saliency Detection Using Dynamic Fusion of Spatial-Temporal Features in Complex Background with Disturbance

Video Saliency Detection via Dynamic Consistent Spatio-Temporal Attention Modelling.

Video Saliency Detection Using Multi-Level Spatiotemporal Orientation.

Spatio-temporal salience based video quality assessment

A Dataset and Evaluation Methodology for Visual Saliency in Video

End-to-End Video Saliency Detection Via a Deep Contextual Spatiotemporal Network

Saliency Detection in Face Videos: A Data-Driven Approach.

Graph-Theoretic Spatiotemporal Context Modeling for Video Saliency Detection

A novel visual saliency detection method for infrared video sequences

Fast Video Saliency Detection Via Maximally Stable Region Motion and Object Repeatability

A New Technique for Multi-Oriented Scene Text Line Detection and Tracking in Video