A Framework for Handwritten Date Recognition in Quality Documents

Haoran Tang,Jian Zhang,Kaihong Yan,Gaoang Wang,Hongwei Wang
DOI: https://doi.org/10.1109/icebe55470.2022.00048
2022-01-01
Abstract:Document text recognition is an important task in optical character recognition. Due to the difficulty in collecting real handwriting samples and the variety of handwritten characters, there are still challenges to recognize handwritten text in quality documents. In this paper, we propose a framework for handwritten date recognition in nuclear power quality documents and a pre-training data construction method to improve the recognition effect. This framework specifically consists of an image feature extraction module and a text transcription module. The text transcription module combines the text transcription functions in Convolutional Recurrent Neural Network (CRNN) and Show-Attend-and-Read (SAR). The experiments are evaluated on handwritten date recognition with nuclear power quality documents. The evaluation presents the framework achieves obviously improvement on metrics and the pre-training dataset can help speed up training.
What problem does this paper attempt to address?