Traffic Sign Detection and Recognition Using Multi-Scale Fusion and Prime Sample Attention

Jinghao Cao,Junju Zhang,Wei Huang
DOI: https://doi.org/10.1109/access.2020.3047414
IF: 3.9
2021-01-01
IEEE Access
Abstract:Traffic sign detection, though one of the key technologies in intelligent transportation, still has bottleneck in accuracy due to the small size and diversity of traffic signs. To solve this problem, we proposed a two-stage CNN object detection algorithm based on multi-scale feature fusion and prime sample attention. We improved the original Faster R-cnn model in terms of feature extraction and sampling strategy. For feature extraction, to elevate the ability of neural networks to detect small objects, we adopted HRNet as the feature extractor. There are four stages in HRNet - a series of high resolution subnets as the starting point with repeated adding parallel high to low resolution subnets to form other stages. In the whole process, the information in the parallel multi-resolution sub-network is repeatedly exchanged to perform repeated multi-scale fusion. For sampling strategy, we adopted a simple and effective sampling and learning strategy called Prime Sample Attention (PISA), consisting of Importance-based Sample Reweighting (ISR) and Classification Aware Regression Loss (CARL). PISA proposed the concepts of IoU Hierarchical Partial Sorting (IoU-HLR) and Hierarchical Partial Score Sorting (Score-HLR), which sort the importance of positive samples and negative samples in mini-batch respectively. With the proposed method, the training process is focusing on prime samples rather than evenly treat all ones. The algorithm complexity of our method is lower than that of other state-of-the-art. After experiments by TT100K dataset, our method can attain a comparable or even better detection accuracy and robustness.
computer science, information systems,telecommunications,engineering, electrical & electronic
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the accuracy of traffic sign detection in intelligent transportation systems. Although traffic sign detection technology has been widely used, due to the small size and diversity of traffic signs, there are still bottlenecks in its detection accuracy. Specifically, the technical challenges mentioned in the paper include: 1. **Difficulties in small - object detection**: For computers, it is always difficult to detect relatively small objects in the entire image. 2. **Difficulties in detecting multi - instruction traffic signs with fixed shapes**: For example, although the traffic signs in the TT100K dataset have only three shapes (rectangle, triangle, and circle), they cover 200 different instruction types. 3. **Multiple factors affect detection accuracy**: For example, changes in the size of objects in the field of view, bad weather, damage to traffic signs themselves, etc., may all affect the accuracy of detection. To meet these challenges, the author proposes a two - stage CNN object detection algorithm based on multi - scale feature fusion and Prime Sample Attention (PISA). By improving the feature extraction and sampling strategies of the Faster R - CNN model, this method aims to improve the detection ability of small targets and optimize sample selection during the training process, thereby improving detection accuracy and robustness. Specific technical improvements include using HRNet as a feature extractor and introducing the PISA method to optimize the learning strategy of the model. These improvements make the detection accuracy of this method on the TT100K dataset reach a level comparable to or better than that of existing advanced methods.