Text Extraction from Mail Images with Complex Background.

Qingqing Wang,Xiao Tu,Shujing Lu,Yue Lu
DOI: https://doi.org/10.1007/978-981-10-8108-8_1
2017-01-01
Abstract:A novel method is proposed for text extraction from mail images with complex background. Firstly, wavelet transform and Laplacian operator are applied to generate the features of regions which are obtained by dividing input image with sliding window. Then, support vector machine (SVM) is utilized to classify these regions into texts and non-texts according to the features. Bootstrap strategy is used to build the training database. Finally, connected components analysis (CCA) is employed to merge text regions into text candidates which can be processed by following steps to get the delivery address. Experimental results involving 534 mail images show the effectiveness and robustness of the proposed method, and comparison results with other methods demonstrate the advantages of the selected features.
What problem does this paper attempt to address?