Multi-oriented Scene Text Detection by Fixed-Width Multi-Ratio Rotation Anchors

Beiji Zou,Wenjun Yang,Shu Liu,Lingzi Jiang
DOI: https://doi.org/10.1016/j.compeleceng.2021.107428
IF: 4.152
2021-01-01
Computers & Electrical Engineering
Abstract:Scene text detection plays an important role in many real-world applications. In this paper, we propose a multi-oriented scene text detection framework, which includes three main modules. We utilize a deep residual network in the front of the framework to learn text representations. A set of fixed-width, multi-ratio rotation anchors is introduced to slide over convolutional feature maps and generate the text proposals with orientation information. An in-network recurrent architecture is then seamlessly connected, where the sequential context of proposals is encoded in order to facilitate the construction of text lines. Extensive experiments are conducted on two ICDAR benchmarks to demonstrate the effectiveness of our approach.
What problem does this paper attempt to address?