A Multi-Scale Coordinate Embedded Dual-Path U-Net for Shallow-Sea Source Localization

Xuening Wang,Hao Zhou,Jiawen He,Bin Zhang,Ruichun Tang
DOI: https://doi.org/10.1145/3672758.3672770
2024-01-01
Abstract:The data-driven shallow-water sound source location (SSL) method effectively alleviates the dependence of traditional matching field processing (MFP) methods on prior information. The multi-task learning U-Net based on convolutional block attention module (MTL-UNET-CBAM) used for SSL task exists the problems of single feature, limited spatial receptive field of attention and loss of location information. In order to solve the above problems, a multi-scale U-Net algorithm with coordinate pyramid attention (CPA) module and dual decoding attention (DDA) module is proposed to estimate the range and depth of sound sources in shallow water environment, called CPA-DDA-UNET. The CPA method extracts the coordinate attention map of up-sampled and down-sampled features of U-Net and embeds them into the pyramid paths, which can process the multi-scale input features and effectively establish the long-term dependency between multi-scale channel concerns. The coordinate embedding method reduces the loss of location information caused by two-dimensional global pooling and enables the network to focus on large areas. The DDA module establishes the dependency between the low-level feature and the high-level feature, and the dual encoder structure can estimate the range and depth simultaneously. The simulation results in SWellEx-96 environment show that the CPA-DDA-UNET method has higher location robustness, and the method can improve the range estimation accuracy significantly, but the depth prediction accuracy is limited. The results of ablation experiments show the effectiveness of CPA and DDA modules.
What problem does this paper attempt to address?