A Detection Transformer-Based Intelligent Identification Method for Multiple Types of Road Traffic Safety Facilities

Lingxin Lu,Hui Wang,Yan Wan,Feifei Xu
DOI: https://doi.org/10.3390/s24103252
IF: 3.9
2024-05-21
Sensors
Abstract:Road traffic safety facilities (TSFs) are of significant importance in the management and maintenance of traffic safety. The complexity and variety of TSFs make it challenging to detect them manually, which renders the work unsustainable. To achieve the objective of automatic TSF detection, a target detection dataset, designated TSF-CQU (TSF data collected by Chongqing University), was constructed based on images collected by a car recorder. This dataset comprises six types of TSFs and 8410 instance samples. A detection transformer with an improved denoising anchor box (DINO) was selected to construct a model that would be suitable for this scenario. For comparison purposes, Faster R-CNN (Region Convolutional Neural Network) and Yolov7 (You Only Look Once version 7) were employed. The DINO model demonstrated the highest performance on the TSF-CQU dataset, with a mean average precision (mAP) of 82.2%. All of the average precision (AP) values exceeded 0.8, except for streetlights (AP = 0.77) and rods (AP = 0.648). The DINO model exhibits minimal instances of erroneous recognition, which substantiates the efficacy of the contrastive denoising training approach. The DINO model rarely makes misjudgments, but a few missed detection.
engineering, electrical & electronic,chemistry, analytical,instruments & instrumentation
What problem does this paper attempt to address?
The problem this paper attempts to address is the automatic detection and recognition of road traffic safety facilities (TSF). Specifically, in the current traffic safety management, due to the numerous and complex types of traffic safety facilities, manually detecting these facilities is both time-consuming and prone to errors, leading to unsustainable management work. Therefore, the paper proposes an intelligent recognition method based on Detection Transformer, aiming to achieve automatic detection of various types of road traffic safety facilities. To achieve this goal, the researchers constructed a dataset named TSF-CQU, which contains 6 types of traffic safety facilities and 8410 instance samples. The researchers chose an improved denoising anchor box detection transformer (DINO) to build the model and conducted comparative experiments with Faster R-CNN and YOLOv7. The experimental results show that the DINO model performed best on the TSF-CQU dataset, with a mean Average Precision (mAP) of 82.2%, and the average precision (AP) for most categories exceeded 0.8, proving the effectiveness of this method.