From object detection to text detection and recognition: A brief evolution history of optical character recognition

Haifeng Wang,Changzai Pan,Xiao Guo,Chunlin Ji,Ke Deng
DOI: https://doi.org/10.1002/wics.1547
2021-01-25
WIREs Computational Statistics
Abstract:<p>Text detection and recognition, which is also known as optical character recognition (OCR), is an active research area under quick development with a lot of exciting applications. Deep‐learning‐based methods represent the state‐of‐art of this area. However, these methods are largely deterministic: they give a deterministic output for each input. For both statisticians and general users, methods supporting uncertainty inference are of great appeal, leaving rich research opportunities to incorporate statistical models and methods with the established deep‐learning‐based approaches. In this paper, we provide a comprehensive review of the evolution history of research development on OCR with discussions on the statistical insights behind these developments and potential directions to enhance the current methods with statistical approaches. We hope this article can serve as a useful guidebook for statisticians who are seeking for a path toward edge‐cutting research in this exciting area.</p><p>This article is categorized under: </p><ul class="plain-list"><li>Statistical Learning and Exploratory Methods of the Data Sciences &gt; Deep Learning</li><li>Data: Types and Structure &gt; Image and Spatial Data</li></ul>
What problem does this paper attempt to address?