Character Flow Detection and Rectification for Scene Text Spotting.

Beiji Zou,Wenjun Yang,Kai-Wen Li,Enquan Huang,Shu Liu
DOI: https://doi.org/10.1007/978-3-030-89029-2_23
2021-01-01
Abstract:Text can be widely found in natural scenes. However, it is considerably difficult to detect and recognize the scene text due to its variations and distortions. In this paper, we propose a three-stage bottom-up scene text spotter, including text segmentation, text rectification and text recognition. The text segmentation part adopts a feature pyramid network (FPN) to extract character instances by combining local and global information, then a joint network of FPN and bidirectional long short-term memory is developed to explore the affinity among the isolated characters, which are grouped into character flows. The text rectification part utilizes a spatial transformer network to deal with the complex deformation of the character flows, thus enhancing their readability. Finally, the rectified text is recognized through an attention-based sequence recognition network. Extensive experiments are conducted on several benchmarks, showing that our approach achieves the state-of-the-art performance.
What problem does this paper attempt to address?