High-speed Scene Text Detection with Attention and Multi-scale Label Generation

Yanzhao Wang,Xiaodong Gu
DOI: https://doi.org/10.1007/s11063-022-10975-7
IF: 2.565
2022-01-01
Neural Processing Letters
Abstract:Scene text detection are useful in abundant areas of work and daily life. Due to the limitation of regression-based methods in detecting irregular shape text (such as curve text), segmentation- based methods, being able to detect text in various shapes, have aroused intense interest of researchers and become the mainstream of scene text detection. However, in segmentation- based methods, complex post-processing decreases the detection speed. In this paper, we propose a high-speed scene text detection method which adopts attention mechanism and multi-scale label generation. It performs well on both detection speed and detection accuracy. Due to the adoption of pyramid attention network, position attention module, multi-scale label generation method, and trainable binarization, our method achieves high detection accuracy. Meanwhile, without complex post-processing, our method achieves high detection speed. On Total-text dataset, it outperforms the state-of-the-art methods with 1.5% improvement of F-measure.
What problem does this paper attempt to address?