Remote Sensing Vehicle Detection Based on SWTPH-YOLOv5

Chenen Xie,HongBing Ma
DOI: https://doi.org/10.1109/mvipit60427.2023.00039
2023-01-01
Abstract:In transportation complexes, remote sensing technology used to detect vehicles faces challenges such as inaccurate positioning, false alarms, and any detection. To address these issues, this article introduces a remote sensing method based on swtfh yolov5 for detecting vehicles. Firstly, a new feature has been added in the shallow layer to maximize information superiority. Secondly, in order to improve the performance of detecting vehicles of different sizes in various remote sensing images, the advantage of pyramid networks lies in their display.Moreover, the C3 module is enhanced by incorporating Swin Transformer blocks to better preserve contextual semantic information of small objects. Furthermore, the boundary box loss function is optimized by using SIoU instead of CIoU loss, leading to faster convergence speed and higher convergence accuracy of the model. Lastly, CBAM attention modules are incorporated into the Neck to enhance feature extraction capabilities and improve overall model performance. Experimental comparisons conducted on three public datasets, namely UCAS-AOD-CAR, VEDAI, and DIOR-Vehicle, demonstrate the advantages of the SWTPH-YOLOv5 model over state-of-the-art object detection methods. The results show that the SWTPH-YOLOv5 model achieves a 4.6% improvement in accuracy, 7.3% improvement in recall rate, and 5.5% improvement in mAP50 compared to the YOLOv5 model.
What problem does this paper attempt to address?