An Accurate Threshold Insensitive Kernel Detector for Arbitrary Shaped Text.

Xijun Qian,Yifan Liu,Yubin Yang
DOI: https://doi.org/10.1109/icpr48806.2021.9412325
2021-01-01
Abstract:Recently, segmentation-based methods are popular in scene text detection due to the segmentation results that can easily represent scene text of arbitrary shapes. However, previous works view text instances as normal objects. It is obvious that the edge of text differs from normal objects. In this paper, we propose a threshold insensitive kernel detector for arbitrary shaped text called TIKD, which includes a simple but stable base model and a new learning weight called Decay Loss Weight (DLW). By suppressing outlier pixels in a gradual way, DLW can lead the network to learn more accurate text instances. Our method shows great power in accuracy and stability. It is worth mentioning that we achieve the precision, recall, f-measure of 88.7%, 83.7%, 86.1% respectively on the Total-Text dataset, with a fast speed of 16.3 frames per second. What's more, even if we set the threshold in an extreme situation range from 0.1 to 0.9, our method can always achieve a stable f-measure over 79.9% on the Total-Text dataset.
What problem does this paper attempt to address?