Multi-font Multi-Size Printed Uyghur Character Recognition

王华,丁晓青,哈力木拉提
DOI: https://doi.org/10.3321/j.issn:1000-0054.2004.07.022
2004-01-01
Abstract:A Uyghur optical character recognition method was developed for multi-font multi-size printed Uyghur characters. Initially, pre-classification information is used to divide the entire character set into several subsets with two strategies employed to recognize a character. The character is first digitized on two meshes, 32×32 points and 24×24 points with the directional line element features then extracted from the two meshes. After dimensional reduction, the feature vectors are then classified using a modified quadratic discriminant function (MQDF). The recognition results produced by the two recognition strategies are integrated based on an overall confidence value. Finally, the local and structural features are selected to discriminate between similar characters. The recognition accuracy on a test set containing 48 800 characters reached 99.48%.
What problem does this paper attempt to address?