Pyramid attention object detection network with multi-scale feature fusion

Xiu Chen,Yujie Li,Yoshihisa Nakatoh
DOI: https://doi.org/10.1016/j.compeleceng.2022.108436
2022-10-27
Abstract:With the development of deep learning, object detection has made substantial progress. However, when the object to be detected in the image is small or partially occluded, the detection network often fails to detect it successfully. We propose a multi-scale feature fusion pyramid attention module, which effectively combines the global average pooling results of multiple scales with the upper features in the residual blocks of the feature extraction network to obtain more spatial context information in the original feature map. We added the multi-scale feature fusion pyramid attention module proposed in this paper based on YoloV3 and conducted experiments on the PASCALL VOC and MS COCO datasets. The experimental results show that the attention module can effectively help the network detect small objects and accurately detect partially occlusion objects.
engineering, electrical & electronic,computer science, interdisciplinary applications, hardware & architecture
What problem does this paper attempt to address?