Handwritten Text Recognition Using Convolutional Neural Network

Atman Mishra,A. Sharath Ram,Kavyashree C
DOI: https://doi.org/10.48550/arXiv.2307.05396
2023-07-11
Abstract:OCR (Optical Character Recognition) is a technology that offers comprehensive alphanumeric recognition of handwritten and printed characters at electronic speed by merely scanning the document. Recently, the understanding of visual data has been termed Intelligent Character Recognition (ICR). Intelligent Character Recognition (ICR) is the OCR module that can convert scans of handwritten or printed characters into ASCII text. ASCII data is the standard format for data encoding in electronic communication. ASCII assigns standard numeric values to letters, numeral, symbols, white-spaces and other characters. In more technical terms, OCR is the process of using an electronic device to transform 2-Dimensional textual information into machine-encoded text. Anything that contains text both machine written or handwritten can be scanned either through a scanner or just simply a picture of the text is enough for the recognition system to distinguish the text. The goal of this papers is to show the results of a Convolutional Neural Network model which has been trained on National Institute of Science and Technology (NIST) dataset containing over a 100,000 images. The network learns from the features extracted from the images and use it to generate the probability of each class to which the picture belongs to. We have achieved an accuracy of 90.54% with a loss of 2.53%.
Computer Vision and Pattern Recognition,Artificial Intelligence
What problem does this paper attempt to address?
The problem that this paper attempts to solve is Handwritten Text Recognition (HTR). Specifically, it is to use the Convolutional Neural Network (CNN) model to recognize handwritten English numbers and letters. Handwritten text recognition technology is used to convert human - readable handwritten languages into machine - readable codes. This problem is challenging because different individuals do not write the same character exactly the same, and changes in personal writing styles and writing environments also have an impact on handwritten characters. Therefore, developing a system that can efficiently and accurately recognize handwritten text has important practical application value, especially in scientific and commercial applications that need to process a large amount of handwritten data. The paper mentions that the authors used a data set provided by the National Institute of Standards and Technology (NIST), which contains more than 100,000 images, to train their CNN model. Through this model, they achieved an accuracy rate of 90.54% and a loss rate of 2.53%. This shows that, despite some challenges, the CNN - based handwritten text recognition method can achieve a relatively high recognition accuracy, making it possible for practical applications.