Adversarial Feature Enhancing Network for End-to-End Handwritten Paragraph Recognition

Yaoxiong Huang,Zecheng Xie,Lianwen Jin,Yuanzhi Zhu,Shuaitao Zhang
DOI: https://doi.org/10.1109/ICDAR.2019.00073
2019-01-01
Abstract:To date, offline handwriting paragraph recognition systems either separately crop text line images and recognize them or perform implicit line segmentation by integrating complicated multi-dimensional long short-term memory (MDLSTM) with an attention mechanism. The former abovementioned approachs could lead to sub-optimal performances while the latter is very time-consuming. In this paper, a fast end-to-end system, called adversarial feature enhancing network (AFEN), is proposed for offline handwritten paragraph recognition. The proposed AFEN system comprises five components: a shared feature extractor for robust feature learning, a text detection branch for text box proposal; RoIRotate for oriented feature region extraction, an adversarial feature learning network for joint feature learning of text detection and recognition branch, and a text recognition branch for text transcription. Experiments on two popular handwritten paragraph recognition benchmarks, namely IAM and Rimes are used to verify the efficacy of the proposed AFEN system. The proposed approach yields impressive results compared to previously proposed systems in the literature.
What problem does this paper attempt to address?