AccNet: Occluded Scene Text Enhancing Network with Accretion Blocks

Yanxiang Gong,Zhiqiang Zhang,Guozhen Duan,Zheng Ma,Mei Xie
DOI: https://doi.org/10.1007/s00138-022-01351-5
IF: 2.983
2022-01-01
Machine Vision and Applications
Abstract:Scene text with occlusions is common in the real world, and occluded text recognition is important for many machine vision applications. However, corresponding techniques are not well explored as public datasets cannot represent the situation well, and methods designed for occluded text are still scarce. In this work, we discuss different kinds of occlusions and propose an occluded scene text enhancing network to improve recognition performance. The network is based on generative adversarial networks, and we design accretion blocks to help the network generate the occluded image regions. The model is independent of the recognition networks, so it can be readily used in different frameworks and can be easily trained without the annotations of text content. We also refine the training objective to improve the framework. Experiments on several public benchmarks demonstrate that the proposed method effectively enhances occluded text images, improving recognition accuracy by over 10% on several state-of-the-art frameworks. Meanwhile, the network has no severe impact on the text images without occlusions.
What problem does this paper attempt to address?