Text detection of two major Indian scripts in natural scene images

Aruni Roy Chowdhury, Ujjwal Bhattacharya, Swapan K Parui
2012-01-01
Abstract:In this article, we present a robust scheme for detection of Devanagari or Bangla texts in scene images. These are the two most popular scripts in India. The proposed scheme is primarily based on two major characteristics of such texts - (i) variations in stroke thickness for text components of a script are low compared to their non-text counterparts and (ii) presence of a headline along with a few vertical downward strokes originating from this headline. We use the Euclidean distance transform to verify the general characteristics of texts in (i). Also, we apply the probabilistic Hough line transform to detect the characteristic headline of Devanagari and Bangla texts. Further, similarity and adjacency measures are applied to identify text regions, which do not satisfy the verification in (ii). The proposed approach has been simulated on a repository of 120 images taken from Indian roads and the results are …
What problem does this paper attempt to address?