Abstract:Optical Character Recognition (OCR) poses a crucial challenge within the realm of computer vision research, as it plays a pivotal role in converting vast amounts of unstructured text data into structured formats to support diverse artificial intelligence applications. The OCR process encompasses two core components: text detection and text recognition. Text detection involves identifying and extracting text regions, achieved through either object detection or segmentation techniques, while text recognition focuses on accurately deciphering the content within these identified regions. In recent years, remarkable strides have been made in the domain of text recognition, primarily driven by deep learning-based models. These models eliminate the need for manual feature processing and excel in recognizing text even within complex scenes, surpassing the performance of traditional text recognition methods and subsequently emerging as the dominant approach. The objective of this paper is to present a comprehensive survey of both text detection and text recognition models. Firstly, we systematically categorize and provide an overview of existing off-the-shelf text detection methods. Subsequently, we conduct an in-depth investigation of six distinct text recognition models, taking into account their unique implementations. Additionally, we explore and analyze the principal datasets that currently prevail in the field of text detection and recognition. Furthermore, this research entails a meticulous performance comparison of various text detection algorithms on the CTW1500, TotalText, and ICDAR2015 datasets. Additionally, we evaluate and scrutinize the efficacy of mainstream text recognition algorithms on the IIIT-5K, SVT, ICDAR2013, SVT-P, CUTE80, and ICDAR2015 datasets. Finally, we conclude with a discussion on the future development and research trends concerning text detection and recognition, providing insights that can further drive progress in this crucial area.

Improving Deep Learning Based Optical Character Recognition Via Neural Architecture Search

Improved optical character recognition with deep neural network

Improving Text Image Resolution Using a Deep Generative Adversarial Network for Optical Character Recognition

Efficient Multi-domain Text Recognition Deep Neural Network Parameterization with Residual Adapters

A Deep Learning Technology based OCR Framework for Recognition Handwritten Expression and Text

Efficient, Lexicon-Free OCR using Deep Learning

From object detection to text detection and recognition: A brief evolution history of optical character recognition

A Survey of Text Detection and Recognition Algorithms Based on Deep Learning Technology

Neural Architecture Search for Deep Face Recognition

Deep Learning For Optical Character Recognition And Its Application To Vat Invoice Recognition

OCR using CRNN: A Deep Learning Approach for Text Recognition

Optical Character Detection and Recognition for Image-Based in Natural Scene

Research on Text Detection and Recognition Based on OCR Recognition Technology

Exploring Neural Architecture Search for Text Classification

Deep Learning-Based Multifunctional End-to-End Model for Optical Character Classification and Denoising

A New Deep Neural Architecture Search Pipeline for Face Recognition.

OCR is All you need: Importing Multi-Modality into Image-based Defect Detection System

AutoSpace: Neural Architecture Search with Less Human Interference

Neural architecture optimization

Neural Architecture Search Using Genetic Algorithm for Facial Expression Recognition

Optical Character Recognition, Using K-Nearest Neighbors