Text Detection In Born-Digital Images By Mass Estimation

Jiamin Xu,Palaiahnakote Shivakumara,Tong Lu,Chew Lim Tan,Michael Blumenstein
DOI: https://doi.org/10.1109/ACPR.2015.7486591
2015-01-01
Abstract:There is a need for effective web-document understanding due to the explosive progress of Internet and network technologies. In this paper, we propose a new method for text detection in born-digital images by introducing a mass estimation concept. We propose to explore super-pixel information of different color channels to identify text atoms in images. The proposed method uses similarity graphs and spectral clustering to identify candidate text regions. We propose a new idea of mapping Gabor responses of a candidate text region to a spatial circle to study the spatial coherency ofpixels. We introduce a mass estimation concept to identify text candidates from the pixel distribution in a spatial circle. The linear linkage graphs help in grouping text candidates to obtain fill text lines. The same Gabor responses are used as features to eliminate false positives with an SVM classifier. We evaluate the proposed method for the testing on standard damsels, such as ICDAR 2013 (challenge-1) and the Situ et al. dataset. Experimental results on both the datasets show that the proposed method outperfOrms the existing methods.
What problem does this paper attempt to address?