Generation of a synthetic handwritten Bangla compound character dataset using a modified conditional GAN architecture

Anubhab Das,Arka Choudhuri,Arpan Basu,Ram Sarkar
DOI: https://doi.org/10.1007/s11042-022-13891-z
IF: 2.577
2023-03-25
Multimedia Tools and Applications
Abstract:Developing an Optical Character Recognition (OCR) system for handwritten texts is a challenging research problem. Handwritten text can be largely different even for the same piece of text since the writing style differs from person to person. On the other hand, for many regional languages, unavailability of datasets having a large quantity of varied images is a hindrance for the research advancements. This is especially true for Bangla, which is the most widely spoken language of Bangladesh and the second most widely spoken language of India. Only a few works have been proposed for the generation of handwritten Bangla basic characters and almost none related to the generation of handwritten Bangla compound characters. To this end, in this work, a method for the generation of synthetic handwritten Bangla compound characters is proposed to alleviate this data scarcity. A generative adversarial network (GAN) based model is developed for this purpose taking inspiration from the recent Auxiliary Classifier GAN (AC-GAN) model. A novel dataset partitioning scheme is also developed for handwritten character related tasks to improve the performance of the model. The quality of generated samples is evaluated in terms of the Fréchet Inception Distance (FID) metric. It is observed that the present model performs better in comparison to the basic AC-GAN architecture and also in comparison with some present GAN architectures. The sample dataset generated as a part of this work is publicly available at https://github.com/hachiro-2001/Bengali_Compound_Characters.
computer science, information systems, theory & methods,engineering, electrical & electronic, software engineering
What problem does this paper attempt to address?