Abstract:Texts on the intelligent transportation scene include mass information. Fully harnessing this information is one of the critical drivers for advancing intelligent transportation. Unlike the general scene, detecting text in transportation has extra demand, such as a fast inference speed, except for high accuracy. Most existing real-time text detection methods are based on the shrink mask, which loses some geometry semantic information and needs complex post-processing. In addition, the previous method usually focuses on correct output, which ignores feature correction and lacks guidance during the intermediate process. To this end, we propose an efficient multi-scene text detector that contains an effective text representation similar mask (SM) and a feature correction module (FCM). Unlike previous methods, the former aims to preserve the geometric information of the instances as much as possible. Its post-progressing saves 50$\%$ of the time, accurately and efficiently reconstructing text contours. The latter encourages false positive features to move away from the positive feature center, optimizing the predictions from the feature level. Some ablation studies demonstrate the efficiency of the SM and the effectiveness of the FCM. Moreover, the deficiency of existing traffic datasets (such as the low-quality annotation or closed source data unavailability) motivated us to collect and annotate a traffic text dataset, which introduces motion blur. In addition, to validate the scene robustness of the SM-Net, we conduct experiments on traffic, industrial, and natural scene datasets. Extensive experiments verify it achieves (SOTA) performance on several benchmarks. The code and dataset are available at: \url{<a class="link-external link-https" href="https://github.com/fengmulin/SMNet" rel="external noopener nofollow">this https URL</a>}.

HFENet: Hybrid Feature Enhancement Network for Detecting Texts in Scenes and Traffic Panels

An Efficient Framework for Detection and Recognition of Numerical Traffic Signs

Real-Time Text Detection with Similar Mask in Traffic, Industrial, and Natural Scenes

EMDFNet: Efficient Multi-scale and Diverse Feature Network for Traffic Sign Detection

Boundary-aware Arbitrary-shaped Scene Text Detector with Learnable Embedding Network

MFECN: Multi-level Feature Enhanced Cumulative Network for Scene Text Detection.

Focus Entirety and Perceive Environment for Arbitrary-Shaped Text Detection

Detecting Text in Scene and Traffic Guide Panels With Attention Anchor Mechanism

HAFE: A Hierarchical Awareness and Feature Enhancement Network for Scene Text Recognition

Adaptive Segmentation Network for Scene Text Detection

(HTBNet)Arbitrary Shape Scene Text Detection with Binarization of Hyperbolic Tangent and Cross-Entropy

Attention-based Feature Decomposition-Reconstruction Network for Scene Text Detection

Scene Text Detection Using HRNet and Spatial Attention Mechanism

FETNet: Feature Erasing and Transferring Network for Scene Text Removal

A Multi-Level Feature Fusion Network for Scene Text Detection with Text Attention Mechanism

MOST: A Multi-Oriented Scene Text Detector with Localization Refinement

A Multi-Scale Natural Scene Text Detection Method Based on Attention Feature Extraction and Cascade Feature Fusion

Detecting Text in the Wild with Deep Character Embedding Network

MixNet: Toward Accurate Detection of Challenging Scene Text in the Wild

TFG-Net: A Text Feature-Guided Network for Small Traffic Sign Detection