Sonar Waveform Synthesis Based on Mel Generative Adversarial Networks with Self-Attention Mechanisms

Yidi Zhu,Zhiquan Bai,Xin'ao Li,Haixiao Wang,Wanzeng Kong
DOI: https://doi.org/10.1109/coa58979.2024.10723654
2024-01-01
Abstract:Considering the complicated process of traditional sonar waveform synthesis method, in this study, we apply models from the speech domain to the underwater acoustic domain based on deep learning algorithms using a transfer learning approach. The core of this study lies in the development of a transfer learning framework based on the generative adversarial network (GAN) model, which effectively adapts to the special characteristics of sonar waveforms by fine-tuning the pretrained MelGAN model and the self-attention mechanism. Because sonar waveform cannot be accurately assessed by auditory assessments, we adopted Fréchet Audio Distance (FAD) as an objective audio assessment metric. Experiments based on the DeepShip dataset show that the proposed model effectively synthesizes a sonar waveform.
What problem does this paper attempt to address?