Knowledge-Based Scene Text Recognition for Industrial Applications

Guowei Deng,Jingzheng Tu,Cailian Chen,Jianping He,Xinyi Le
DOI: https://doi.org/10.1109/icit48603.2022.10002724
2022-01-01
Abstract:Scene text recognition (STR) methods combined with semantic information have made great progress to recognize texts in natural scenes, most of which are daily words. However, research on mining semantic information in industrial texts attracts less attention. Since industrial texts follow a different semantic pattern defined by industry standards, it challenges many existing methods to conduct accurate semantic reasoning. In this paper, we abstract the industry standards into two aspects of prior knowledge: the grouping property and the prior lexicon. Correspondingly, a knowledge-based language model is proposed with several group-wise correlation modules and a lexicon-based reasoning module to learn semantic rules from both data and the prior knowledge. Besides, we transfer the prior knowledge into data by generating synthetic pure text datasets according to the industry standards’ rules, which introduces more prior knowledge to the language model. Furthermore, a novel STR framework is presented by combining the knowledge-based language model and an attention-based vision model. For evaluation, two industrial text datasets called CIN and SB are collected from real-world industrial field surveillance. Experiments indicate that our method’s word-level accuracy outperforms state-of-the-art methods with 15% and 10.61% on CIN and SB datasets respectively.
What problem does this paper attempt to address?