Improved YOLOv4 Based on Dilated Coordinate Attention for Object Detection

Zhenzhen Yang,Yixin Zheng,Jing Shao,Yongpeng Yang
DOI: https://doi.org/10.1007/s11042-023-17817-1
IF: 2.577
2023-01-01
Multimedia Tools and Applications
Abstract:Classical YOLOv4 object detector transcends some famous object detectors in speed and accuracy. However, despite its superior performance, it still has some limitations such as the insufficient for extracting the feature. Therefore, we propose an improved YOLOv4 method for object detection in this paper. Specifically, we introduce a dilated coordinate attention module for improving the accuracy of object detection, which combine the dilated convolutional neural network with coordinate attention. At the same time, the multi-scale training strategy is also introduced to enhance the performance of object detection via using the training data with different scales. Experiments on PASCAL VOC2007 and VOC2012 datasets demonstrate that our proposed improved YOLOv4 object detector is superior to other state of the art object detection detector. Specially, its performance is better than the traditional YOLOv4, which is 1.84% and 1.91% higher on the mean average precision (mAP) for the two datasets, respectively.
What problem does this paper attempt to address?