Abstract:Bangla has a unique script with a complex set of characters, making it a fascinating subject of study for linguists and cultural enthusiasts. Unique in some of its similar characters which are only distinguishable by subtle differences in their shapes and diacritics, there has been a notable increase in research on Bangla character recognition and classification using machine learning-based approaches. However, Handwritten Bangla Character Recognition (HBCR) training requires an adequate amount of data from a diversely distributed dataset. Making diverse datasets for HBCR training is a challenging and tedious task to carry out. Yet, there is limited research on the automatic generation of handwritten Bangla characters. Motivated by this open area of research, this paper proposes a novel approach 'Okkhor-Diffusion' for class-guided generation of Bangla isolated handwritten characters using a novel Denoising Diffusion Probabilistic Model (DDPM). No prior research has used DDPM for this purpose, making the proposed approach novel. The DDPM is a generative model that uses a diffusion process to transform noise-corrupted data into diverse samples; despite being trained on a small training set. In our experiments, StyleGAN2-ADA had notably inferior performance compared to Okkhor-Diffusion in generating realistic isolated handwritten Bangla characters. Experimental results on the BanglaLekha-Isolated dataset demonstrate that the proposed Okkhor-Diffusion model generates realistic isolated handwritten Bangla characters, with a mean Multi-Scale Structural Similarity Index Measure (MS-SSIM) score of 0.178 compared to 0.177 for the real samples. The Fréchet Inception Distance (FID) score for the synthetic handwritten Bangla characters is 5.426. Finally, the newly proposed Bangla Character Aware Fréchet Inception Distance (BCAFID) score of the proposed Okkhor-Diffusion model is 10.388. The code for the proposed Okkhor-Diffusion framework is available at https://github.com/MubtasimFuad10/Okkhor-Diffusion.

Generation of a synthetic handwritten Bangla compound character dataset using a modified conditional GAN architecture

Classification of Bangla Compound Characters Using a HOG-CNN Hybrid Model

Okkhor-Diffusion: Class Guided Generation of Bangla Isolated Handwritten Characters Using Denoising Diffusion Probabilistic Model (DDPM)

Bangla Handwritten Digit Recognition and Generation

Handwritten Word Recognition using Deep Learning Approach: A Novel Way of Generating Handwritten Words

MatriVasha: A Multipurpose Comprehensive Database for Bangla Handwritten Compound Characters

Handwritten Bangla Basic and Compound character recognition using MLP and SVM classifier

Generation of simulated data for Bengali text localization in natural images

Convolutional neural network-based ensemble methods to recognize Bangla handwritten character

GACnet-Text-to-Image Synthesis With Generative Models Using Attention Mechanisms With Contrastive Learning

End-to-End Optical Character Recognition for Bengali Handwritten Words

BanglaNet: Bangla Handwritten Character Recognition using Ensembling of Convolutional Neural Network

Bengali Handwritten Character Classification using Transfer Learning on Deep Convolutional Neural Network

Multichannel Attention Networks with Ensembled Transfer Learning to Recognize Bangla Handwritten Charecter

Feature Extraction Using Deep Generative Models for Bangla Text Classification on a New Comprehensive Dataset

Bengali Handwritten Grapheme Classification: Deep Learning Approach

Performance Evaluation of Deep Generative Models for Generating Hand-Written Character Images

Enhancement of Bengali OCR by Specialized Models and Advanced Techniques for Diverse Document Types

Text2FaceGAN: Face Generation from Fine Grained Textual Descriptions

Bengali Handwritten Digit Recognition using CNN with Explainable AI

Handwritten Bangla character recognition using convolutional neural networks: a comparative study and new lightweight model