Abstract:Mongolian is one of the most common written languages in China, Mongolia, and Russia. Many printed Mongolian documents still remain to be digitized for digital library applications. The traditional Mongolian script has a unique vertical cursive writing style and multiple font variations, which makes Mongolian Optical Character Recognition challenging. As the traditional Mongolian script has subcomponent characteristics, such that one character may be a constituent of another character, in this work we define a novel character set for recognition using segmented components. The components are combined into characters in a rule-based post-processing module. For overall character recognition, a method based on Visual Directional Features and multi-level classifiers is presented. For character segmentation, segmentation points are identified by analyzing the properties of projection profiles and connected components. Mongolian has dozens of different printed font types that can be categorized into two major groups, namely, standard and handwritten-style groups. The segmentation parameters are adjusted for each group. Additionally, script identification and relevant character recognition kernels are integrated for the recognition of Mongolian text mixed with Chinese and English. A novel multi-font printed Mongolian document recognition system based on the proposed methods is implemented. Experiments indicate a text recognition rate of 96.9% on the test samples from real documents with multiple font types and mixed script. The proposed methods can also be applied to other scripts in the Mongolian script family, such as Todo and Sibe, with significant potential for extension to historic Mongolian documents.

End-to-End Model Based on Bidirectional LSTM and CTC for Segmentation-free Traditional Mongolian Recognition

Cross-Lingual Text Image Recognition Via Multi-Task Sequence to Sequence Learning.

Chinese Image Text Recognition with BLSTM-CTC: A Segmentation-Free Method.

Multi-font Printed Mongolian Document Recognition System

Recognition of Handwritten Chinese Text by Segmentation: A Segment-annotation-free Approach

A Deep Investigation of RNN and Self-attention for the Cyrillic-Traditional Mongolian Bidirectional Conversion

An End-to-End, Segmentation-Free, Arabic Handwritten Recognition Model on KHATT

Segmentation and Recognition for Historical Tibetan Document Images

TAMS: Translation-Assisted Morphological Segmentation

LCSegNet: An Efficient Semantic Segmentation Network for Large-Scale Complex Chinese Character Recognition

Bidirectional LSTM-CRF Attention-based Model for Chinese Word Segmentation

See-Lpr: A Semantic Segmentation Based End-To-End System For Unconstrained License Plate Detection And Recognition

Uyghur Character Models with Shared Structure Information for Segmentation-free Recognition under Low Data Resource Conditions

State-of-the-art Chinese Word Segmentation with Bi-LSTMs

Bi-directional LSTM Recurrent Neural Network for Chinese Word Segmentation

Research on the LSTM Mongolian and Chinese machine translation based on morpheme encoding

An approach for handwritten Chinese text recognition unifying character segmentation and recognition

Offline Mongolian Handwriting Recognition Based on Data Augmentation and Improved ECA-Net

A Multiplexed Network for End-to-End, Multilingual OCR

Long Short-Term Memory Neural Networks for Chinese Word Segmentation.