Traffic Sign Detection and Recognition Using Multi-Scale Fusion and Prime Sample Attention

Jinghao Cao,Junju Zhang,Wei Huang

DOI: https://doi.org/10.1109/access.2020.3047414

IF: 3.9

2021-01-01

IEEE Access

Abstract:Traffic sign detection, though one of the key technologies in intelligent transportation, still has bottleneck in accuracy due to the small size and diversity of traffic signs. To solve this problem, we proposed a two-stage CNN object detection algorithm based on multi-scale feature fusion and prime sample attention. We improved the original Faster R-cnn model in terms of feature extraction and sampling strategy. For feature extraction, to elevate the ability of neural networks to detect small objects, we adopted HRNet as the feature extractor. There are four stages in HRNet - a series of high resolution subnets as the starting point with repeated adding parallel high to low resolution subnets to form other stages. In the whole process, the information in the parallel multi-resolution sub-network is repeatedly exchanged to perform repeated multi-scale fusion. For sampling strategy, we adopted a simple and effective sampling and learning strategy called Prime Sample Attention (PISA), consisting of Importance-based Sample Reweighting (ISR) and Classification Aware Regression Loss (CARL). PISA proposed the concepts of IoU Hierarchical Partial Sorting (IoU-HLR) and Hierarchical Partial Score Sorting (Score-HLR), which sort the importance of positive samples and negative samples in mini-batch respectively. With the proposed method, the training process is focusing on prime samples rather than evenly treat all ones. The algorithm complexity of our method is lower than that of other state-of-the-art. After experiments by TT100K dataset, our method can attain a comparable or even better detection accuracy and robustness.

computer science, information systems,telecommunications,engineering, electrical & electronic

What problem does this paper attempt to address?

The problem that this paper attempts to solve is the accuracy of traffic sign detection in intelligent transportation systems. Although traffic sign detection technology has been widely used, due to the small size and diversity of traffic signs, there are still bottlenecks in its detection accuracy. Specifically, the technical challenges mentioned in the paper include: 1. **Difficulties in small - object detection**: For computers, it is always difficult to detect relatively small objects in the entire image. 2. **Difficulties in detecting multi - instruction traffic signs with fixed shapes**: For example, although the traffic signs in the TT100K dataset have only three shapes (rectangle, triangle, and circle), they cover 200 different instruction types. 3. **Multiple factors affect detection accuracy**: For example, changes in the size of objects in the field of view, bad weather, damage to traffic signs themselves, etc., may all affect the accuracy of detection. To meet these challenges, the author proposes a two - stage CNN object detection algorithm based on multi - scale feature fusion and Prime Sample Attention (PISA). By improving the feature extraction and sampling strategies of the Faster R - CNN model, this method aims to improve the detection ability of small targets and optimize sample selection during the training process, thereby improving detection accuracy and robustness. Specific technical improvements include using HRNet as a feature extractor and introducing the PISA method to optimize the learning strategy of the model. These improvements make the detection accuracy of this method on the TT100K dataset reach a level comparable to or better than that of existing advanced methods.

Traffic Sign Detection and Recognition Using Multi-Scale Fusion and Prime Sample Attention

An Efficient Framework for Detection and Recognition of Numerical Traffic Signs

A Three-Stage Real-Time Detector for Traffic Signs in Large Panoramas

Traffic sign detection based on multi-scale feature extraction and cascade feature fusion

Road Traffic Sign Detection Method Based on RTS R-CNN Instance Segmentation Network

Toward Effective Traffic Sign Detection via Two-Stage Fusion Neural Networks

Neural Network Based on Multi-Scale Saliency Fusion for Traffic Signs Detection

A Traffic-Sign Detection Algorithm Based on Improved Sparse R-cnn

A feature‐enhanced hybrid attention network for traffic sign recognition in real scenes

Learning multi-layer interactive residual feature fusion network for real-time traffic sign detection with stage routing attention

Efficient Traffic-Sign Recognition with Scale-aware CNN

Traffic Signs Detection and Segmentation Based on the Improved Mask R-CNN

A lightweight vehicle mounted multi-scale traffic sign detector using attention fusion pyramid

A small‐target traffic sign detection algorithm based on partial conv and atrous spatial pyramid

Small Object Detection in Traffic Scenes Based on Attention Feature Fusion

Multi-scale feature fusion with attention mechanism for crowded road object detection

Focus First: Coarse-to-Fine Traffic Sign Detection with Stepwise Learning

Traffic Sign Recognition With Lightweight Two-Stage Model in Complex Scenes

MIAF-Net: A Multi-Information Attention Fusion Network for Field Traffic Sign Detection

Traffic-Sign Detection and Classification in the Wild

Improved YOLOv5 network for real-time multi-scale traffic sign detection