HRYNet: A Highly Robust YOLO Network for Complex Road Traffic Object Detection

Lindong Tang,Lijun Yun,Zaiqing Chen,Feiyan Cheng
DOI: https://doi.org/10.3390/s24020642
IF: 3.9
2024-01-20
Sensors
Abstract:Object detection is a crucial component of the perception system in autonomous driving. However, the road scene presents a highly intricate environment where the visibility and characteristics of traffic targets are susceptible to attenuation and loss due to various complex road scenarios such as lighting conditions, weather conditions, time of day, background elements, and traffic density. Nevertheless, the current object detection network must exhibit more learning capabilities when detecting such targets. This also exacerbates the loss of features during the feature extraction and fusion process, significantly compromising the network's detection performance on traffic targets. This paper presents a novel methodology by which to overcome the concerns above, namely HRYNet. Firstly, a dual fusion gradual pyramid structure (DFGPN) is introduced, which employs a two-stage gradient fusion strategy to enhance the generation of more comprehensive multi-scale high-level semantic information, strengthen the interconnection between non-adjacent feature layers, and reduce the information gap that exists between them. HRYNet introduces an anti-interference feature extraction module, the residual multi-head self-attention mechanism (RMA). RMA enhances the target information by implementing a characteristic channel weighting policy, thereby reducing background interference and improving the attention capability of the network. Finally, the detection performance of HRYNet was evaluated by utilizing three datasets: the horizontally collected dataset BDD1000K, the UAV high-altitude dataset Visdrone, and a custom dataset. Experimental results demonstrate that HRYNet achieves a higher mAP_0.5 compared with YOLOv8s on the three datasets, with increases of 10.8%, 16.7%, and 5.5%, respectively. To optimize HRYNet for mobile devices, this study presents Lightweight HRYNet (LHRYNet), which effectively reduces the number of model parameters by 2 million. The results demonstrate that LHRYNet outperforms YOLOv8s in terms of mAP_0.5, with improvements of 6.7%, 10.9%, and 2.5% observed on the three datasets, respectively.
engineering, electrical & electronic,chemistry, analytical,instruments & instrumentation
What problem does this paper attempt to address?
This paper attempts to solve the problem of traffic object detection in complex road scenarios. Specifically, the author points out several main challenges that current object detection networks face when dealing with complex traffic scenarios: 1. **Complex traffic scenarios**: Factors such as weather, lighting conditions, time, background elements, and traffic density can lead to the weakening or loss of the visibility and feature information of traffic objects. 2. **Large variation in object scales**: The size differences of objects in traffic scenarios are significant, requiring the algorithm to be able to adapt to objects of different scales. 3. **Loss of feature information**: Existing object detection networks rely on convolution and pooling operations, which are prone to cause the loss of feature information during feature extraction and fusion. To address these challenges, this paper proposes a new method named HRYNet (Highly Robust YOLO Network). HRYNet improves the detection performance in complex traffic scenarios through the following two key technologies: 1. **Dual - Fusion Progressive Pyramid Structure (DFGPN)**: A two - stage gradient fusion strategy is introduced, which enhances the generation of multi - scale high - level semantic information, strengthens the connection between non - adjacent feature layers, and reduces the information gap between them. 2. **Anti - interference Feature Extraction Module (RMA, Residual Multi - Head Self - Attention Mechanism)**: By implementing a feature channel weighting strategy, the target information is enhanced, background interference is reduced, and the attention ability of the network is improved. In addition, in order to optimize the application of HRYNet on mobile devices, the research also proposes a lightweight version, LHRYNet, which effectively reduces the number of model parameters while maintaining high detection performance. The experimental results show that HRYNet and its lightweight version LHRYNet achieve better performance than YOLOv8s on multiple datasets, especially when dealing with traffic objects with weak feature information.