Abstract:We have developed a series of effective online data augmentation strategies for the traffic sign recognition dataset, which are able to improve the model performance without any extra computational overhead during the prediction process; To enhance the feature extraction ability of the backbone almost without extra model complexity, we have developed an efficient CSAM module placed at the beginning of the backbone, with the help of the hybrid channel and spatial attention mechanism and the residual bottleneck structure; To make better use of the features extracted by the backbone, we combined the channel attention module CAM with the feature pyramid network (FPN) and path aggregation network (PAN) structure for a multi‐scale attention feature fusion detection head. Currently, traffic sign recognition techniques have been brought into the assistive driving of automobiles. However, small traffic sign recognition in real scenes is still a challenging task due to the class imbalance issue and the size limit of the traffic signs. To address the above issues, a feature‐enhanced hybrid attention network is proposed based on YOLOv5s for a small, fast, and accurate traffic sign detector. First, a series of online data augmentation strategies are designed in the preprocessing module for the model training. Second, the hybrid channel and spatial attention module CSAM are integrated into the backbone for a better feature extraction ability. Third, the channel attention module CAM is used in the detection head for a more efficient feature fusion ability. To validate the approach, extensive experiments are conducted based on the Tsinghua‐Tencent 100K dataset. It is found that the novel method achieves state‐of‐the‐art performance with only negligible increases in the model parameter and computational overhead. Specifically, the mAP@0.5 , parameters, and FLOPs are 85.8%, 7.13 M, and 16.1 G, respectively.

Toward Effective Traffic Sign Detection via Two-Stage Fusion Neural Networks

An Efficient Framework for Detection and Recognition of Numerical Traffic Signs

Traffic Sign Detection using Feature Fusion and Contextual Information

Attention-based Neural Network for Traffic Sign Detection

Learning multi-layer interactive residual feature fusion network for real-time traffic sign detection with stage routing attention

Road Traffic Sign Detection Method Based on RTS R-CNN Instance Segmentation Network

Traffic Sign Detection and Recognition Using Multi-Scale Fusion and Prime Sample Attention

MIAF-Net: A Multi-Information Attention Fusion Network for Field Traffic Sign Detection

Attention Feature Fusion Network for Small Traffic Sign Detection

Traffic Sign Recognition With Lightweight Two-Stage Model in Complex Scenes

Small Traffic Sign Detection Through Selective Feature Fusion Based Faster R-CNN with Arc-Softmax Loss

YOLO-TS: Real-Time Traffic Sign Detection with Enhanced Accuracy Using Optimized Receptive Fields and Anchor-Free Fusion

Focus First: Coarse-to-Fine Traffic Sign Detection with Stepwise Learning

Traffic sign detection based on multi-scale feature extraction and cascade feature fusion

Traffic Sign Detection Based on Camera Imaging Apriority.

Multi-Feature Fusion and Enhancement Single Shot Detector for Traffic Sign Recognition

Context-Aware and Attention-Driven Weighted Fusion Traffic Sign Detection Network

A feature‐enhanced hybrid attention network for traffic sign recognition in real scenes

Improved Traffic Sign Detection Algorithm Based on Faster R-CNN

Transformer Fusion and Residual Learning Group Classifier Loss for Long-Tailed Traffic Sign Detection

CAFFNet: Channel Attention and Feature Fusion Network for Multi-target Traffic Sign Detection