A Handwritten Chinese Characters Recognition Method Based on Sample Set Expansion and CNN

Xuchen Song,Xue Gao,Yanfang Ding,Zhixin Wang
DOI: https://doi.org/10.1109/icsai.2016.7811068
2016-01-01
Abstract:Convolutional neural networks (CNN) is a powerful technology for classification of visual inputs. However, both the scale and quality of the training set are an important factor to the performance of a learned system. In real applications, it is generally difficult to obtain a high-quality and large-scale handwritten Chinese characters sample set. Insufficient samples of handwritten Chinese characters would cause poor recognition performance. In this paper, we propose a handwritten Chinese character recognition method based on dataset expansion and CNNs. Firstly, the topology of proposed Convolutional neural networks model is addressed. Then, several dataset expansion techniques are utilized to expand the scale of available samples, which include random elastic deformation, shear transformation and rotation within a small range, etc. A series of experiments on the HCL2000 Chinese character handwriting database have shown that our method can effectively improve the recognition performance, with a reduction in error rate of 35.01%, verified the effectiveness of our proposed approach.
What problem does this paper attempt to address?