OCR with a Convolutional Neural Networks Integration Model in Machine Vision

Rui Zhang,Xiaojun Wu,Lingteng Qiu,Zhicheng Yang
DOI: https://doi.org/10.1117/12.2503305
2018-01-01
Abstract:Optical character recognition (OCR) in complex scenes, particularly in industry environment, is a challenging problem that has received a significant amount of attention. A unified model for different types of character in different production lines is needed. In this paper, we propose a unified framework to classify characters using convolutional neural network (CNN) to satisfy the two main requirements in industrial OCR, the high recognition rate and less training time by combining the representational power of multi-layer neural networks together with multi-stage features. In the model, there are three CNNs, two with multi-stage features and one with deeper layers, which can be used to extract different fonts and types characters in different complex background. The results in experiments demonstrate the efficiency with high recognition rate and less training time in complex industrial environment.
What problem does this paper attempt to address?