End-To-End Chinese Text Recognition

Jie Hu,Tszhang Guo,Ji Cao,Changshui Zhang
DOI: https://doi.org/10.1109/GlobalSIP.2017.8309193
2017-01-01
Abstract:In this paper, we propose a new method for Chinese text recognition, which comprises two main contributions: First, we create a large Chinese text dataset, including 260 thousand images collected from business card and 390 thousand synthetic images generated by rendering engine. Second, we use these images to train a deep network to perform text recognition, which can recognize more than six thousand kinds of Chinese character accurately. Although our system is composed of different types of neural networks (CNN and LSTM), it is end-to-end trainable. Experiments demonstrate that our system achieve a high recognition accuracy in which synthetic data plays an important role.
What problem does this paper attempt to address?