Abstract:Underwater acoustic signal recognition (UASR) systems face challenges in achieving high accuracy when processing complex data with low signal-to-noise ratio (SNR) in underwater environments, leading to limited noise robustness. Conventional approaches typically employ pre-trained denoising models for preprocessing noisy signals. However, due to disparate optimization goals between denoising and recognition models, denoising methods might introduce signal distortion, hampering effective enhancement of system accuracy. To address this issue, this article proposes a novel joint training framework with cross-attention fusion for robust UASR, called CAF-JT. CAF-JT consists of a denoising module, a recognition module, and the CAF module. It addresses the mismatch problem arising from different optimization directions by jointly training the denoising frontend and the recognition backend. Additionally, inspired by the multicondition training (MCT) method, the CAF module is designed to fuse characteristics from both denoised and noisy audio, thus incorporating noise information. This fusion mechanism enables the model to better adapt to the characteristics of the noisy environment and enhance its noise robustness. Furthermore, to improve the performance of UASR, time-frequency transformer (TF-transformer) blocks are incorporated into both the denoising module and the recognition module to capture the spatio-temporal distribution of spectral features. The proposed approach is evaluated on two open-source underwater acoustic signal datasets, namely ShipsEar and DeepShip. Extensive experimental demonstrate the superiority of CAF-JT over conventional joint training approaches, showcasing its improved noise robustness. Particularly in low SNR conditions, CAF-JT achieves the best average recognition rates of 94.84% and 93.61% on the two datasets, respectively.

A Novel Noise-Aware Deep Learning Model for Underwater Acoustic Denoising.

Self-Noise Suppression for AUV without Clean Data: a Noise2Noise Approach

Noise-Aware Subband Attention Network for Underwater Acoustic Signal Denoising

DBSA-Net: Dual Branch Self-Attention Network for Underwater Acoustic Signal Denoising.

Underwater Acoustic Signal Noise Reduction Based on a Fully Convolutional Encoder-Decoder Neural Network

Underwater Acoustic Signal Denoising Algorithms: A Survey of the State-of-the-art

Deep Learning for Noise Attenuation from the Ocean Bottom Node 4C Data

Underwater Target Classification at Greater Depths Using Deep Neural Network with Joint Multiple‐domain Feature

An attention-based multi-scale convolution network for intelligent underwater acoustic signal recognition

A Novel Underwater Acoustic Signal Denoising Algorithm for Gaussian/Non-Gaussian Impulsive Noise

Sparsity-Driven ALE Algorithm of Underwater Acoustic Tonals with Experiment Verification

A New Underwater Acoustic Signal Denoising Technique Based on CEEMDAN, Mutual Information, Permutation Entropy, and Wavelet Threshold Denoising.

A Denoising Representation Framework for Underwater Acoustic Signal Recognition.

SimNFND: A Forward-Looking Sonar Denoising Model Trained on Simulated Noise-Free and Noisy Data

Underwater Noise Target Recognition Based on Sparse Adversarial Co-Training Model with Vertical Line Array

A Novel Cross-Attention Fusion-Based Joint Training Framework for Robust Underwater Acoustic Signal Recognition

A new underwater acoustic signal denoising technique based on CEEMDAN, mutual information, permutation entropy, and wavelet threshold denoising

A Self-Supervised Denoising Strategy for Underwater Acoustic Camera Imageries

Deep Learning for Denoising: an Attempt to Recover the Effective Magnetic Resonance Sounding Signal in the Presence of High Level Noise.

An Attention‐guided Convolution Neural Network for Denoising of Distributed Acoustic Sensing–vertical Seismic Profile Data

Research on Underwater Image Denoising Based on Dual-Channels Residual Network