Automatic Detection and Localization of Natural Scene Text in Video

Xiaodong Huang,Huadong Ma
DOI: https://doi.org/10.1109/ICPR.2010.786
IF: 8
2010-01-01
Pattern Recognition
Abstract:Video scene text contains semantic information and thus can contribute significantly to video indexing and summarization. However, most of the previous approaches to detecting scene text from videos experience difficulties in handling texts with various character size and text alignments. In this paper, we propose a novel algorithm of scene text detection and localization in video. Based on our observation that text character strokes show intensive edge details in the fixed orientation no matter what text alignment and size are, a stroke map is first generated. In the scene text detection, we extract the texture feature of stroke map to locate text lines. The detected scene text lines are accurately located by using Harris' corners in the stroke map. Experimental results show that this approach is robust and can be effectively applied to scene text detection and localization in video.
What problem does this paper attempt to address?