Edge-Pixels Clustering for Text Area Extraction

Hui Fu,Xiabi Liu,Yunde Jia
DOI: https://doi.org/10.3321/j.issn:1003-9775.2006.05.019
2006-01-01
Abstract:An approach based on edge-pixels clustering to extract Chinese and English text areas from an image is proposed. The image is segmented into pixel-subclasses based on the colors and positions of edge-pixels. And then the initial text areas are extracted according to the features of edges in text area. The boundaries of the initial text areas are expanded for the entire text areas. Furthermore, an algorithm of text area binarization is presented to improve the efficiency of post-processing by reducing the number of binary images when the text color polarity is unknown. The experimental results show that the proposed approach is effective with integrality up to 99%.
What problem does this paper attempt to address?