Abstract:Optical Character Recognition (OCR) stands as a pivotal technology in the digitization and processing of textual information from images. In this study, we propose a novel approach to OCR leveraging the VGG19 convolutional neural network (CNN) architecture. VGG19, renowned for its depth and performance in image classification tasks, is repurposed here to tackle the intricate challenges of character recognition. Through extensive experimentation and evaluation, this study demonstrate the efficacy of the proposed approach in achieving state-of-the-art accuracy and robustness in extracting textual information from images. This study uses diverse datasets comprising printed and handwritten text samples, augmenting them using various techniques to enhance model generalization. The VGG19 model is trained end-to-end, with its convolutional layers serving as feature extractors for character recognition. This paper presents a novel approach to Optical Character Recognition (OCR) using the VGG19 convolutional neural network (CNN). OCR is a fundamental technology that converts printed or handwritten text into digital format, facilitating document digitization and information retrieval. The proposed method leverages the hierarchical features learned by VGG19 to accurately extract textual information from images. This study has conducted experiments using publicly available datasets, achieving significant improvements in both training and test accuracy across epochs. Specifically, the proposed model has achieved a training accuracy of 94.34% and a test accuracy of 94.96% after ten epochs of training. Furthermore, we observed a consistent decrease in both training and test loss throughout the training process, indicating effective convergence and refinement of the model parameters. These results demonstrate the efficacy of the VGG19-based OCR model in accurately recognizing characters from diverse input images, highlighting its potential for various real-world applications such as document digitization, augmented reality, and accessibility tools.

Deep Learning-Based Multifunctional End-to-End Model for Optical Character Classification and Denoising

Multinoise-type Blind Denoising Using a Single Uniform Deep Convolutional Neural Network.

Frequency-Relevant Residual Learning for Multi-Modal Image Denoising.

Image Denoising Via Multi-Scale Gated Fusion Network

Ensemble Model of Attention Mechanism-Based DCGAN and Autoencoder for Noised OCR Classification

Deep Convolutional Architecture for Natural Image Denoising

Efficient Deep Image Denoising Via Class Specific Convolution

OCR is All you need: Importing Multi-Modality into Image-based Defect Detection System

A Deep Learning Technology based OCR Framework for Recognition Handwritten Expression and Text

Dilated Residual Encode-Decode Networks for Image Denoising

A Deep Learning-Based Pre-Trained VGG19 Model for Optical Character Recognition

Improving Text Image Resolution Using a Deep Generative Adversarial Network for Optical Character Recognition

Efficient, Lexicon-Free OCR using Deep Learning

DECDM: Document Enhancement using Cycle-Consistent Diffusion Models

Improved optical character recognition with deep neural network

A Multiscale Image Denoising Algorithm Based On Dilated Residual Convolution Network

Deep learning optical image denoising research based on principal component estimation

Intelligent Micron Optical Character Recognition of DFB Chip Using Deep Convolutional Neural Network

Efficient Multi-domain Text Recognition Deep Neural Network Parameterization with Residual Adapters

Noise Reduction in Optical Coherence Tomography Images Using a Deep Neural Network with Perceptually-Sensitive Loss Function.

An optimized deep learning model for optical character recognition applications