An underwater target recognition algorithm incorporating improved attention mechanism and downsampling

DOI: https://doi.org/10.1007/s00371-024-03437-9
IF: 2.835
2024-05-14
The Visual Computer
Abstract:To address the issue of low accuracy in recognizing underwater targets due to dense and blurred targets in underwater target detection, we propose a joint improved attention mechanism and downsampling network for underwater target detection. Firstly, to address the issue of dense targets, we introduce an improved channel attention module. This module enhances attention to spatial dimension information, highlights the saliency of feature maps of different channels and improves the detection ability of dense targets. Secondly, to address the issue of blurred underwater targets, we introduce a down-sampling module that combines same-layer connections and cross-layer skipping. This module reduces information loss caused by convolutional down-sampling and integrates features from different layers more fully. By improving the feature expression of the underwater image, the network's detection accuracy for underwater blurred targets is further enhanced. Finally, the study introduces the focus loss function to address the imbalance of positive and negative samples. This function dynamically reduces the weight of easy-to-distinguish samples during training and prioritizes difficult-to-distinguish samples. Experimental results demonstrate a 2.71% increase in average accuracy of the improved algorithm on the DUO dataset. Additionally, the calculation amount is reduced by 9.1 GFLOPs, and the parameter amount is reduced by 5.44 M. Code: https://figshare.com/articles/dataset/improved-yolov5/25375129. Dataset: https://figshare.com/articles/dataset/DUO_zip/25370527.
computer science, software engineering
What problem does this paper attempt to address?