A Japanese OCR Post-Processing Approach Based on Dictionary Matching

Chu-Yu Guo,Yuan-Yan Tang,Chang-Song Liu,Jia Duan
DOI: https://doi.org/10.1109/icwapr.2013.6599286
2013-01-01
Abstract:This paper describes a post-processing approach for Japanese character recognition based on dictionary. By the analysis of experimental data in the processing of OCR, we find that some segmentation and recognition results do not conform to the rules of lexical and just generate the character based on the shape. If the fonts of pending recognized characters are similar with the others, it will easily lead to going wrong in the processing of OCR. For these errors we put forward an idea based on the Limited Length Segmentation Matching and the Bayesian Statistical Classifier. Through the above method, most of the font recognized mistakes can be solved. By the experimental results, it can be proved that this method is an effective way to improve the recognized rate of Japanese character.
What problem does this paper attempt to address?