Abstract:Text detection and tracking in video is challenging due to contrast, resolution and background variations, and different orientations and text movements. In addition, the presence of both caption and scene texts in video aggravates the problem because these two text types differ in characteristics significantly . This paper proposes a new technique for detecting and tracking video texts of any orientation by using spatial and temporal information, respectively. The technique explores gradient directional symmetry at component level for smoothing edge components before text detection. Spatial information is preserved by forming Delaunay triangulation in a novel way at this level, which results in text candidates. Text characteristics are then proposed in a different way for eliminating false text candidates , which results in potential text candidates. Then grouping is proposed for combining potential text candidates regardless of orientation based on the nearest neighbor criterion. To tackle the problems of multi-font and multi-sized texts, we propose multi-scale integration by a pyramid structure, which helps in extracting full text lines. Then, the detected text lines are tracked in video by matching the subgraphs of triangulation. Experimental results for text detection and tracking on our video dataset, the benchmark video datasets, and the natural scene image benchmark datasets show that the proposed method is superior to the state-of-the-art methods in terms of recall, precision , and F-measure.

Effective and Efficient Video Text Extraction Using Key Text Points

A new video text detection method.

A Video Text Detection Method Based On Key Text Points

A Novel Approach to Text Detection and Extraction from Videos by Discriminative Features and Density

A Method of Effective Text Extraction for Complex Video Scene

Text Recognition in Video Using OCR

A Research on Video Text Tracking and Recognition

A multiple frame integration and mathematical morphology based technique for video text extraction

An edge-based approach for video text extraction

Text detection, localization, and tracking in compressed video

A New Technique for Multi-Oriented Scene Text Line Detection and Tracking in Video

Robust Text Stroke Extraction From Video

A Fast and Effective Text Tracking in Compressed Video

Text Processing in Video Frames with Complex Background

Scene Video Text Tracking with Graph Matching

A new video text extraction approach

A Combined Algorithm for Video Text Extraction

Multi-Strategy Tracking Based Text Detection in Scene Videos

An Efficient Coarse-To-Fine Scheme For Text Detection In Videos

An Efficient Video Text Recognition System

Video Text Enhancement Using Multiple Frame Information