Abstract:Optical Character Recognition (OCR) poses a crucial challenge within the realm of computer vision research, as it plays a pivotal role in converting vast amounts of unstructured text data into structured formats to support diverse artificial intelligence applications. The OCR process encompasses two core components: text detection and text recognition. Text detection involves identifying and extracting text regions, achieved through either object detection or segmentation techniques, while text recognition focuses on accurately deciphering the content within these identified regions. In recent years, remarkable strides have been made in the domain of text recognition, primarily driven by deep learning-based models. These models eliminate the need for manual feature processing and excel in recognizing text even within complex scenes, surpassing the performance of traditional text recognition methods and subsequently emerging as the dominant approach. The objective of this paper is to present a comprehensive survey of both text detection and text recognition models. Firstly, we systematically categorize and provide an overview of existing off-the-shelf text detection methods. Subsequently, we conduct an in-depth investigation of six distinct text recognition models, taking into account their unique implementations. Additionally, we explore and analyze the principal datasets that currently prevail in the field of text detection and recognition. Furthermore, this research entails a meticulous performance comparison of various text detection algorithms on the CTW1500, TotalText, and ICDAR2015 datasets. Additionally, we evaluate and scrutinize the efficacy of mainstream text recognition algorithms on the IIIT-5K, SVT, ICDAR2013, SVT-P, CUTE80, and ICDAR2015 datasets. Finally, we conclude with a discussion on the future development and research trends concerning text detection and recognition, providing insights that can further drive progress in this crucial area.

High-Performance Ocr On Packing Boxes In Industry Based On Deep Learning

A Deep Learning Technology based OCR Framework for Recognition Handwritten Expression and Text

Efficient, Lexicon-Free OCR using Deep Learning

Research on Text Detection and Recognition Based on OCR Recognition Technology

Robust Detection of Headland Boundary in Paddy Fields from Continuous RGB-D Images Using Hybrid Deep Neural Networks

Unknown-box Approximation to Improve Optical Character Recognition Performance

PP-OCRv2: Bag of Tricks for Ultra Lightweight OCR System

PP-OCR: A Practical Ultra Lightweight OCR System

The Industrial Application of Artificial Intelligence-Based Optical Character Recognition in Modern Manufacturing Innovations

Glycosaminoglycans in developing chick‐ embryo aorta revealed by ruthenium red: An electron‐microscope study

Efficient Multi-domain Text Recognition Deep Neural Network Parameterization with Residual Adapters

PP-OCRv3: More Attempts for the Improvement of Ultra Lightweight OCR System

Improved YOLOV5 Angle Embossed Character Recognition by Multiscale Residual Attention with Selectable Clustering

Large-Scale Printed Chinese Character Recognition for ID Cards Using Deep Learning and Few Samples Transfer Learning

Port Container Number Recognition System Based on Improved YOLO and CRNN Algorithm

Deep Learning-Based Multifunctional End-to-End Model for Optical Character Classification and Denoising

OCR is All you need: Importing Multi-Modality into Image-based Defect Detection System

Intelligent Micron Optical Character Recognition of DFB Chip Using Deep Convolutional Neural Network

Development of a core feature identification application based on the Faster R-CNN algorithm

A Survey of Text Detection and Recognition Algorithms Based on Deep Learning Technology

SuperOCR: A Conversion from Optical Character Recognition to Image Captioning