Underwater Target Detection Using Side-Scan Sonar Images Based on Upsampling and Downsampling

Rui Tang,Yimin Chen,Jian Gao,Shaowen Hao,Hunhui He
DOI: https://doi.org/10.3390/electronics13193874
IF: 2.9
2024-10-01
Electronics
Abstract:Side-scan sonar (SSS) images present unique challenges to computer vision due to their lower resolution, smaller targets, and fewer features. Although the mainstream backbone networks have shown promising results on traditional vision tasks, they utilize traditional convolution to reduce the dimensionality of feature maps, which may cause information loss for small targets and decrease performance in SSS images. To address this problem, based on the yolov8 network, we proposed a new underwater target detection model based on upsampling and downsampling. Firstly, we introduced a new general downsampling module called shallow robust feature downsampling (SRFD) and a receptive field convolution (RFCAConv) in the backbone network. Thereby multiple feature maps extracted by different downsampling techniques can be fused to create a more robust feature map with a complementary set of features. Additionally, an ultra-lightweight and efficient dynamic upsampling module (Dysample) is introduced to improve the accuracy of the feature pyramid network (FPN) in fusing different levels of features. On the underwater shipwreck dataset, our improved model's mAP50 increased by 4.4% compared to the baseline model.
engineering, electrical & electronic,computer science, information systems,physics, applied
What problem does this paper attempt to address?