Pedestrian Detection Method Based on Improved YOLOv5s for Densely Occluded Scenarios

Rongting Pan,Guofeng Qin,Yongjian Zhu,Peiwen Mi,Ming Li
DOI: https://doi.org/10.1109/BDICN62775.2024.00026
2024-01-01
Abstract:In response to the issues of low accuracy and high false negatives in real-time detection of densely occluded pedestrians, a pedestrian detection method based on improved YOLOv5s is proposed. This method involves replacing the Centralized-Comprehensive Convolution Block (C3) module in the backbone network with the DenseBlock module to enhance the model's feature extraction capabilities for small targets. The Efficient Intersection over Union (EIoU) loss function was used instead of the Complete Intersection over Union (CIoU) loss function to improve the model's ability to accurately locate densely occluded targets. Based on the feature pyramid structure, the Adaptive Spatial Feature Fusion (ASFF) module was introduced, enhancing the network's feature extraction capability. The introduction of the multi-head self-attention (MHSA) mechanism has enhanced the model's adaptability. The experimental results indicate that the improved YOLOv5s algorithm achieves mAP0.5 and mAP0.5:0.95 of 78.6% and 48.7%, respectively, on the CrowdHuman dataset. This represents an improvement of 2.0% and 2.2% compared to the initial YOLOv5s. The detection time is 11.4ms, showing good robustness for detecting targets of different sizes and meeting real-time requirements.
What problem does this paper attempt to address?