A Feasible Framework for Arbitrary-Shaped Scene Text Recognition

Jinjin Zhang,Wei Wang,Di Huang,Qingjie Liu,Yunhong Wang
DOI: https://doi.org/10.48550/arXiv.1912.04561
2019-12-12
Abstract:Deep learning based methods have achieved surprising progress in Scene Text Recognition (STR), one of classic problems in computer vision. In this paper, we propose a feasible framework for multi-lingual arbitrary-shaped STR, including instance segmentation based text detection and language model based attention mechanism for text recognition. Our STR algorithm not only recognizes Latin and Non-Latin characters, but also supports arbitrary-shaped text recognition. Our method wins the championship on Scene Text Spotting Task (Latin Only, Latin and Chinese) of ICDAR2019 Robust Reading Challenge on ArbitraryShaped Text Competition. Code is available at <a class="link-external link-https" href="https://github.com/zhang0jhon/AttentionOCR" rel="external noopener nofollow">this https URL</a>.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?