Learning based Ge'ez character handwritten recognition

Hailemicael Lulseged Yimer,Hailegabriel Dereje Degefa,Marco Cristani,Federico Cunico
2024-11-20
Abstract:Ge'ez, an ancient Ethiopic script of cultural and historical significance, has been largely neglected in handwriting recognition research, hindering the digitization of valuable manuscripts. Our study addresses this gap by developing a state-of-the-art Ge'ez handwriting recognition system using Convolutional Neural Networks (CNNs) and Long Short-Term Memory (LSTM) networks. Our approach uses a two-stage recognition process. First, a CNN is trained to recognize individual characters, which then acts as a feature extractor for an LSTM-based system for word recognition. Our dual-stage recognition approach achieves new top scores in Ge'ez handwriting recognition, outperforming eight state-of-the-art methods, which are SVTR, ASTER, and others as well as human performance, as measured in the HHD-Ethiopic dataset work. This research significantly advances the preservation and accessibility of Ge'ez cultural heritage, with implications for historical document digitization, educational tools, and cultural preservation. The code will be released upon acceptance.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
This paper aims to solve the problem of handwritten recognition of the ancient Ethiopian script Ge'ez. Specifically, Ge'ez is an ancient writing system of great cultural and historical significance, but it has been largely neglected in handwritten recognition research, which hinders the digitization process of precious manuscripts. This paper fills this gap by developing an advanced Ge'ez handwritten recognition system based on convolutional neural networks (CNNs) and long - short - term memory networks (LSTMs). The system adopts a two - stage recognition process: first, CNN is used to train and recognize individual characters, and then CNN is used as a feature extractor to provide input for the LSTM - based word recognition system. This two - stage recognition method has achieved a new highest score in Ge'ez handwritten recognition, surpassing eight state - of - the - art methods and human performance. These methods include SVTR, ASTER, etc., and are evaluated on the HHD - Ethiopic dataset. The main contributions of the paper are as follows: - A new method based on CNN and LSTM for the recognition of Ge'ez handwritten scripts is proposed; - The model has a character error rate (CER) of 26.95% and a normalized edit distance (NED) of 26.50% in Ge'ez optical character recognition (OCR), achieving the current best performance. Through these contributions, this research significantly promotes the preservation and access of Ge'ez cultural heritage, and is of great significance for the digitization of historical documents, the development of educational tools and cultural protection.