Abstract:Bangla has a unique script with a complex set of characters, making it a fascinating subject of study for linguists and cultural enthusiasts. Unique in some of its similar characters which are only distinguishable by subtle differences in their shapes and diacritics, there has been a notable increase in research on Bangla character recognition and classification using machine learning-based approaches. However, Handwritten Bangla Character Recognition (HBCR) training requires an adequate amount of data from a diversely distributed dataset. Making diverse datasets for HBCR training is a challenging and tedious task to carry out. Yet, there is limited research on the automatic generation of handwritten Bangla characters. Motivated by this open area of research, this paper proposes a novel approach 'Okkhor-Diffusion' for class-guided generation of Bangla isolated handwritten characters using a novel Denoising Diffusion Probabilistic Model (DDPM). No prior research has used DDPM for this purpose, making the proposed approach novel. The DDPM is a generative model that uses a diffusion process to transform noise-corrupted data into diverse samples; despite being trained on a small training set. In our experiments, StyleGAN2-ADA had notably inferior performance compared to Okkhor-Diffusion in generating realistic isolated handwritten Bangla characters. Experimental results on the BanglaLekha-Isolated dataset demonstrate that the proposed Okkhor-Diffusion model generates realistic isolated handwritten Bangla characters, with a mean Multi-Scale Structural Similarity Index Measure (MS-SSIM) score of 0.178 compared to 0.177 for the real samples. The Fréchet Inception Distance (FID) score for the synthetic handwritten Bangla characters is 5.426. Finally, the newly proposed Bangla Character Aware Fréchet Inception Distance (BCAFID) score of the proposed Okkhor-Diffusion model is 10.388. The code for the proposed Okkhor-Diffusion framework is available at https://github.com/MubtasimFuad10/Okkhor-Diffusion.

Confronting the Constraints for Optical Character Segmentation from Printed Bangla Text Image

Segmentation of Offline Handwritten Bengali Script

End-to-End Optical Character Recognition for Bengali Handwritten Words

Enhancement of Bengali OCR by Specialized Models and Advanced Techniques for Diverse Document Types

Design of an Optical Character Recognition System for Camera-based Handheld Devices

A Novel Approach to Printed Arabic Optical Character Recognition

A BLSTM Network for Printed Bengali OCR System with High Accuracy

Kurdish Text Segmentation using Projection-Based Approaches

Extraction of Line Word Character Segments Directly from Run Length Compressed Printed Text Documents

BN-DRISHTI: Bangla Document Recognition through Instance-level Segmentation of Handwritten Text Images

Development of a Multi-User Recognition Engine for Handwritten Bangla Basic Characters and Digits

Bangla handwritten character recognition using MobileNet V1 architecture

Optical Text Recognition in Nepali and Bengali: A Transformer-based Approach

Thinning Chinese, Korean, Japanese and Thai script for segmentation-free OCRs

A Hough Transform based Technique for Text Segmentation

Okkhor-Diffusion: Class Guided Generation of Bangla Isolated Handwritten Characters Using Denoising Diffusion Probabilistic Model (DDPM)

A Complete Workflow for Development of Bangla OCR

Segmentation-Free Bangla Offline Handwriting Recognition using Sequential Detection of Characters and Diacritics with a Faster R-CNN

Classification of Bangla Compound Characters Using a HOG-CNN Hybrid Model

Handwritten Bangla Basic and Compound character recognition using MLP and SVM classifier

bbOCR: An Open-source Multi-domain OCR Pipeline for Bengali Documents