SCENE TEXT EXTRACTION METHOD BASED ON HIERARCHICAL BLOCK FILTERING AND STROKE FEATURES

Bai Hongfei,Jin Cheng
DOI: https://doi.org/10.3969/j.issn.1000-386X.2010.05.019
2010-01-01
Abstract:Scene text contains important semantic information of scene images.So it will be helpful for content analysis,browsing and retrieval of the scene image when the emerging text information is extracted from it.The scene text extraction method proposed in this paper is in such a way that it adopts hierarchical block filtering method to generate scene text regions first by filtering the background on different scales based on edge detecting,after that,the aggregated text regions will be executed the binarized segmentation according to stroke features of the colour and width of the strokes to acquire binarized text image,and these binarized text region image can be authenticated as the inputs of OCR engine so as to achieve the goal of extracting the semantic information of the scene images.This hierarchical block filtering method related in the paper can preferably filter complex background to generate aggregated text regions,and the segmentation of text stroke pixels can be effectively achieved by using the stroke feature of the text.Experimental results also demonstrate that this method is effective.
What problem does this paper attempt to address?