Abstract:Offline Chinese handwritten character string recognition is one of the most important research fields in pattern recognition. Due to the free writing style, large variability in character shapes and different geometric characteristics, Chinese handwritten character string recognition is a challenging problem to deal with. However, among the current methods over-segmentation and merging method which integrates geometric information, character recognition information and contextual information, shows a promising result. It is found experimentally that a large part of errors are segmentation error and mainly occur around non-Chinese characters. In a Chinese character string, there are not only wide characters namely Chinese characters, but also narrow characters like digits and letters of the alphabet. The segmentation error is mainly caused by uniform geometric model imposed on all segmented candidate characters. To solve this problem, post processing is employed to improve recognition accuracy of narrow characters. On one hand, multi-geometric models are established for wide characters and narrow characters respectively. Under multi-geometric models narrow characters are not prone to be merged. On the other hand, top rank recognition results of candidate paths are integrated to boost final recognition of narrow characters. The post processing method is investigated on two datasets, in total 1405 handwritten address strings. The wide character recognition accuracy has been improved lightly and narrow character recognition accuracy has been increased up by 10.41% and 10.03% respectively. It indicates that the post processing method is effective to improve recognition accuracy of narrow characters.

Influence of Language Models and Candidate Set Size on Contextual Post-processing for Chinese Script Recognition.

A hybrid post-processing system for offline handwritten Chinese script recognition

Combining character-based bigrams with word-based bigrams in contextual postprocessing for Chinese script recognition.

A Word Language Model Based Contextual Language Processing On Chinese Character Recognition

Contextual Post-Processing Based on the Confusion Matrix in Offline Handwritten Chinese Script Recognition

Multiple candidate characters in the post-processing for off-line handwritten Chinese character recognition

Off- Line Chinese Writer Identification Based on Character-Level Decision Combination

Post Processing for Offline Chinese Handwritten Character String Recognition

Distant BI-Gram Model, Collocation, and Their Applications in Post-Processing for Chinese Character Recognition

A Hybrid Post-Processing System For Offline Handwritten Chinese Character Recognition Based On A Statistical Language Model

Post-Processing Approach for Printed Chinese Character Recognition

New Post-Processing Method Based on Noisy Channel Model for Chinese Character Recognition

An Adaptive Post-processing Method using Proofreading Information for Chinese Character Recognition

A Post-processing Approach for Handwritten Chinese Address Recognition

Context driven chinese string segmentation and recognition

Effects of prosodic patterns and the morpheme position probability on word segmentation and recognition in overlapping ambiguous strings by learners of Chinese

An Efficient Post-Processing Approach for Off-Line Handwritten Chinese Address Recognition

Rethinking orthographic neighbor in Chinese two-character word recognition: Insights from a megastudy

A Hybrid Post-Processing System for Handwritten Chinese Character Recognition

The Impact of Visual Information in Chinese Characters: Evaluating Large Models' Ability to Recognize and Utilize Radicals

A Chinese OCR Spelling Check Approach Based on Statistical Language Models.