MHW Mongolian Offline Handwritten Dataset and Its Application

Daoerji FAN,Guanglai GAO,Huijuan WU
DOI: https://doi.org/10.3969/j.issn.1003-0077.2018.01.012
2018-01-01
Abstract:A public well-recognized Mongolian offline handwritten database is the basis for the research and develop-ment of Mongolian handwriting recognition system.Based on the research on Mongolian coding,word formation and grammar,a large-vocabulary Mongolian offline handwritten database(MHW)is constructed,which contains 100000 pieces of Mongolian words,i.e.20 samples for each of 5000 words.The test set I contains 5000 samples and test set II contains 14085 samples.An automatic error detection algorithm is applied,which is based on the vari-able length of each Mongolian word.The performance of MHW is validated on three propular handwriting recogni-tion models,among which the Recurrent Neural Network based model shows best performance of 2.20% on test set I and 5.55% on test set II with constrained dictionary.
What problem does this paper attempt to address?