Abstract:Optical Character Recognition (OCR) stands as a pivotal technology in the digitization and processing of textual information from images. In this study, we propose a novel approach to OCR leveraging the VGG19 convolutional neural network (CNN) architecture. VGG19, renowned for its depth and performance in image classification tasks, is repurposed here to tackle the intricate challenges of character recognition. Through extensive experimentation and evaluation, this study demonstrate the efficacy of the proposed approach in achieving state-of-the-art accuracy and robustness in extracting textual information from images. This study uses diverse datasets comprising printed and handwritten text samples, augmenting them using various techniques to enhance model generalization. The VGG19 model is trained end-to-end, with its convolutional layers serving as feature extractors for character recognition. This paper presents a novel approach to Optical Character Recognition (OCR) using the VGG19 convolutional neural network (CNN). OCR is a fundamental technology that converts printed or handwritten text into digital format, facilitating document digitization and information retrieval. The proposed method leverages the hierarchical features learned by VGG19 to accurately extract textual information from images. This study has conducted experiments using publicly available datasets, achieving significant improvements in both training and test accuracy across epochs. Specifically, the proposed model has achieved a training accuracy of 94.34% and a test accuracy of 94.96% after ten epochs of training. Furthermore, we observed a consistent decrease in both training and test loss throughout the training process, indicating effective convergence and refinement of the model parameters. These results demonstrate the efficacy of the VGG19-based OCR model in accurately recognizing characters from diverse input images, highlighting its potential for various real-world applications such as document digitization, augmented reality, and accessibility tools.

A Deep Learning-Based Pre-Trained VGG19 Model for Optical Character Recognition

OCR using CRNN: A Deep Learning Approach for Text Recognition

TrOCR: Transformer-Based Optical Character Recognition with Pre-trained Models

Handwritten Text Recognition Using Convolutional Neural Network

Gated Recurrent Convolution Neural Network for Ocr

Improved Handwritten Digit Recognition Using Convolutional Neural Networks (CNN)

Deep Learning-Based Multifunctional End-to-End Model for Optical Character Classification and Denoising

DeepNetDevanagari: a deep learning model for Devanagari ancient character recognition

Efficient, Lexicon-Free OCR using Deep Learning

Improved optical character recognition with deep neural network

Implementation of OCR using Convolutional Neural Network (CNN): A Survey

Performance analysis of hybrid deep learning framework using a vision transformer and convolutional neural network for handwritten digit recognition

An optimized deep learning model for optical character recognition applications

Enhancement of handwritten text recognition using AI-based hybrid approach

Devanagari Handwritten Character Recognition using fine-tuned Deep Convolutional Neural Network on trivial dataset

Manuscripts Character Recognition Using Machine Learning and Deep Learning

A comparison of deep transfer learning backbone architecture techniques for printed text detection of different font styles from unstructured documents

Convolutional-Neural-Network-Based Handwritten Character Recognition: An Approach with Massive Multisource Data

Facial Expression Recognition Method Based on Improved VGG Convolutional Neural Network

A Convolutional Recurrent Neural-Network-Based Machine Learning for Scene Text Recognition Application

End-to-End Optical Character Recognition for Bengali Handwritten Words