Road Traffic Sign Detection Method Based on RTS R-CNN Instance Segmentation Network

Guirong Zhang,Yiming Peng,Hai Wang
DOI: https://doi.org/10.3390/s23146543
IF: 3.9
2023-07-21
Sensors
Abstract:With the rapid development of the autonomous driving industry, there is increasing research on related perception tasks. However, research on road surface traffic sign detection tasks is still limited. There are two main challenges to this task. First, when the target object's pixel ratio is small, the detection accuracy often decreases. Second, the existing publicly available road surface traffic sign datasets have limited image data. To address these issues, this paper proposes a new instance segmentation network, RTS R-CNN, for road surface traffic sign detection tasks based on Mask R-CNN. The network can accurately perceive road surface traffic signs and provide important information for the autonomous driving decision-making system. Specifically, CSPDarkNet53_ECA is proposed in the feature extraction stage to enhance the performance of deep convolutional networks by increasing inter-channel interactions. Second, to improve the network's detection accuracy for small target objects, GR-PAFPN is proposed in the feature fusion part, which uses a residual feature enhancement module (RFA) and atrous spatial pyramid pooling (ASPP) to optimize PAFPN and introduces a balanced feature pyramid module (BFP) to handle the imbalanced feature information at different resolutions. Finally, data augmentation is used to generate more data and prevent overfitting in specific scenarios. The proposed method has been tested on the open-source dataset Ceymo, achieving a Macro F1-score of 87.56%, which is 2.3% higher than the baseline method, while the inference speed reaches 23.5 FPS.
engineering, electrical & electronic,chemistry, analytical,instruments & instrumentation
What problem does this paper attempt to address?
The problems that this paper attempts to solve are two main challenges encountered in the task of accurately detecting road traffic signs in self - driving vehicles: 1. **Low detection accuracy for small targets**: When the pixel proportion of the target object in the image is small, the detection accuracy often decreases. This means that for smaller traffic signs, existing detection methods may not be able to provide sufficient accuracy. 2. **Limited public data sets**: Existing public road traffic sign data sets contain limited image data, which restricts the amount of data for model training, thereby affecting the robustness and generalization ability of the model. To solve the above problems, the paper proposes a new instance segmentation network based on Mask R - CNN - RTS R - CNN. Specific improvement measures include: - **Feature extraction stage**: CSPDarkNet53_ECA is proposed to enhance the performance of the deep convolutional network by increasing the interaction between channels. - **Feature fusion part**: GR - PAFPN is proposed. The Residual Feature Augmentation module (RFA) and Atrous Spatial Pyramid Pooling (ASPP) are used to optimize PAFPN, and the Balanced Feature Pyramid module (BFP) is introduced to handle unbalanced feature information at different resolutions. - **Data augmentation**: Generate more data through techniques such as flipping and adjusting the color space to prevent over - fitting in specific scenarios. These improvements aim to improve the detection accuracy for small target objects and increase the robustness and generalization ability of the model by increasing the amount of data.