Acoustic Image Target Detection Method Based on the Fusion of RepVGG-ECA and YOLOv5

Zhikang Chi,Hongjian Wang,Yanhui Wei,Lihui Deng,Lei Wang,Xinyu Wang,Jinmu Tian,Bo Zhong,Zhao Wang
DOI: https://doi.org/10.1109/oceans51537.2024.10682219
2024-01-01
Abstract:In response to the need for real-time detection of acoustic images in the target detection process and the problem of insufficient feature extraction of small targets, this paper proposes a YOLOv5 acoustic target detection and recognition method based on the fusion of RepVGG and Effective Channel Attention(ECA) modules. First, a dataset of real forward-looking sonar images named Sonar2023 was established, which includes 11,560 original sonar images of four categories of target objects. Secondly, the characteristics of the acoustic images in the dataset were studied. In order to achieve the requirements of real-time detection of underwater targets, some C3 networks in the backbone layer are replaced by the RepVGG networks in the YOLOv5s algorithm. Through the idea of structural re-parameterization, the multi-channel structure of the training network is transformed into the single-channel structure of the inference network, which greatly reduces the number of parameters of the model and can be flexibly deployed on the underwater detection platform. In addition, to address the problems of insufficient feature extraction and unclear information perception for a large number of small targets in sonar images, some ECA modules have been added to the algorithm. Finally, tested on a self-built acoustic image dataset called Sonar2023, the experimental results show that the improved YOLOv5s model achieves a good balance between target detection accuracy and speed, and can effectively recognize targets from acoustic images to meet the needs of real-time detection tasks.
What problem does this paper attempt to address?