CNN-Based Hindi Numeral String Recognition for Indian Postal Automation.

Hongjian Zhan,Shujing Lyu,Umapada Pal,Yue Lu
DOI: https://doi.org/10.1109/icdarw.2019.40085
2019-01-01
Abstract:Digits/numerals in the Indian pin-code of handwritten postal documents may touch each other and hence digit string recognition is a very challenging task. In this paper, we propose a digit string recognition system for Indian postal documents written in Hindi. Unlike normal text string, in a string of digits there is no contextual information among the digits as a digit may be followed by an arbitrary digit in a string of digits. Because of this, here we propose a new architecture which is based on CNN (Convolutional Neural Network) and CTC (Connectionist Temporal Classification), without using RNN for Hindi numeral string recognition. Also to connect CNN with CTC, we transform the outputs of CNN to a two-dimension vector to meet the feeding requirement of CTC. Furthermore, we utilize dense blocks to build CNN part to extract efficient image features. Comparative studies with the state-of-the-art methods show that the proposed method outperforms the other existing methods on Hindi numeral string recognition.
What problem does this paper attempt to address?