Manifold Learning For The Shape-Based Recognition Of Historical Arabic Documents

Mohamed Cheriet,Reza Farrahi Moghaddam,Ehsan Arabnejad,Guoqiang Zhong
DOI: https://doi.org/10.1016/B978-0-444-53859-8.00019-9
2013-01-01
Abstract:In this work, a recognition approach applicable at the letter block (subword) level for Arabic manuscripts is introduced. The approach starts with the binary images of the letter block to build their input representation, which makes it highly objective and independent of the designer. Then, using two different manifold learning techniques, the representations are reduced and learned. In order to decrease the computational complexity, PCA is applied to the input representations before manifold learning is applied. Also, in order to increase the performance and quality of the input representations, a gray stroke map (GSM) is considered in addition to the binary images. The performance of the approach is tested against a database from a historical Arabic manuscript with promising results.
What problem does this paper attempt to address?