CBNet: A Plug-and-Play Network for Segmentation-Based Scene Text Detection
Xi Zhao,Wei Feng,Zheng Zhang,Jingjing Lv,Xin Zhu,Zhangang Lin,Jinghe Hu,Jingping Shao,Zhao, Xi,Feng, Wei,Zhang, Zheng,Lv, Jingjing,Zhu, Xin,Lin, Zhangang,Hu, Jinghe,Shao, Jingping
DOI: https://doi.org/10.1007/s11263-024-02022-w
IF: 13.369
2024-03-05
International Journal of Computer Vision
Abstract:Recently, segmentation-based methods are quite popular in scene text detection, which mainly contain two steps: text kernel segmentation and expansion. However, the segmentation process only considers each pixel independently, and the expansion process is difficult to achieve a favorable accuracy-speed trade-off. In this paper, we propose a context-aware and boundary-guided network (CBN) to tackle these problems. In CBN, a basic text detector is first used to predict initial segmentation results. Then, we propose a context-aware module to enhance text kernel feature representations, which considers both global and local contexts. Finally, we introduce a boundary-guided module to expand enhanced text kernels adaptively with only the pixels on the contours, which not only obtains accurate text boundaries but also keeps high speed, especially on high-resolution output maps. In particular, with a lightweight backbone, the basic detector equipped with our proposed CBN achieves state-of-the-art results on several popular benchmarks, and our proposed CBN can be plugged into several segmentation-based methods. Code will be available on https://github.com/XiiZhao/cbn.pytorch.
computer science, artificial intelligence