Improving HMM-Based Chinese Handwriting Recognition Using Delta Features and Synthesized String Samples.

Tong-Hua Su,Cheng-Lin Liu
DOI: https://doi.org/10.1109/icfhr.2010.18
2010-01-01
Abstract:The HMM-based segmentation-free strategy for Chinese handwriting recognition has the advantage of training without annotation of character boundaries. However, the recognition performance has been limited by the small number of string samples. In this paper, we explore two techniques to improve the performance. First, Delta features are added to the static ones for alleviating the conditional independence assumption of HMMs. We then investigate into techniques for synthesizing string samples from isolated character images. We show that synthesizing linguistically natural string samples utilizes isolated samples insufficiently. Instead, we draw character samples without replacement and concatenate them into string images through between-character gaps. Our experimental results demonstrate that both Delta features and synthesized string samples significantly improve the recognition performance. Combining these with a bigram language model, the recognition accuracy has been increased by 36~38% compared to our previous system.
What problem does this paper attempt to address?