An OCR post-processing method based on dictionary matching and matrix transforming

Chuyu Guo,Yuanyan Tang,Zhenchao Zhang,Bing Li,Changsong Liu
DOI: https://doi.org/10.4028/www.scientific.net/AMM.427-429.1861
2013-01-01
Applied Mechanics and Materials
Abstract:This paper describes a post-processing method for Chinese and Japanese character recognition based on dictionary. By the analysis results of recognition in the processing of OCR, we can find some segmentation and recognition errors do not conform to the rules of lexical and just recognized as the characters which its fonts approach to the scanned texts. For these errors we can deal with them by the Fix Length Segmentation Matching based on Dictionary and the Glyph Code Matrix Transforming. Through the above processing, most of the inaccurate recognitions can be corrected and by the experimental results, it can be proved that this method is an effective way to improve the recognition rate of Chinese and Japanese Character. © (2013) Trans Tech Publications, Switzerland.
What problem does this paper attempt to address?