Research of Chinese Handwritten Text Segmentation Algorithm

Yingyong Zou,Yongde Zhang,Xinyan Cao,Guangbin Yu
DOI: https://doi.org/10.14257/ijmue.2014.9.12.12
2014-01-01
International Journal of Multimedia and Ubiquitous Engineering
Abstract:OCR is a complicated process, there are many factors that can influence the recognition rate.Early period people tried to optimize the classifier to obtain high recognition rate, but the premise is that there is only one character no matter print or handwritten.For the performance of classifier has been promoted a lot, recognition rate for single character is high enough for commercial use.With the development of the demand for handwritten text recognition, how to raise the recognition rate of OCR system becomes very important.Unlike OCR system for print which focus on classifier.The research of OCR system for handwritten text is mainly on character segmentation.Statistical analysis showed that the mistake made by missegment is more than the mistake made by classifier.This is decided by the feature of handwritten text.There are more randomness and the lines are not horizontal, besides that, handwritten Chinese characters are more like overlapped and the gaps between characters are smaller.So this is the difficulty of handwritten Chinese characters.In this paper, the mutil-step searching nonlinear line exaction algorithm the paper proposed is easy and the accuracy is high, which can tackle the some weaknesses of direct projection method and indirect projection.
What problem does this paper attempt to address?