Abstract:For specific wideband sound source localization tasks performed in underwater environments, high precision, low computational complexity, and high robustness are required to meet the demand for accurate location in real-time processes. Traditional sound source localization methods that rely on models and signal processing techniques perform poorly in complex but common scenarios where noise, reverberation, and array deviations exist. Sound source localization systems based on deep neural networks are proposed recently and show superiority over conventional localization methods. However, lighter neural network structures for common wideband signals localization have been underexplored, which are crucial for improving the operational efficiency and practicability in underwater localization systems. In this paper, we propose a light neural network structure to locate the typical underwater wideband sound sources in communication and localization systems. The network mainly consists of convolutional and residual blocks. We train the network with simulated single tones and finetune it with multiple sets of single tones received in the experiment. As for the training feature, we choose the phase component of the covariance matrix and reshape the upper triangle into one column to reduce the feature dimension. The test signals include chirps, single-carrier modulation quadrature phase shift keying(SC-QPSK), multi-carrier modulations such as multi-tone and orthogonal frequency division multiplexing(OFDM). Both the simulation and experiment results have verified the high accuracy and robustness of our proposed method. Compared with the state-of-the-arts, our method obtains superior performance, especially in scenarios with low signal-to-noise ratio (SNR).

Deep Learning for Binaural Sound Source Localization with Low Signal-to-noise Ratio

Binaural Target Sound Source Localization Based on Time-frequency Units Selection

Deep and CNN Fusion Method for Binaural Sound Source Localisation

Full-Sphere Binaural Sound Source Localization Using Multi-task Neural Network

Exploiting Deep Neural Networks and Head Movements for Robust Binaural Localisation of Multiple Sources in Reverberant Environments

Localization Based Stereo Speech Source Separation Using Probabilistic Time-Frequency Masking and Deep Neural Networks

Learning Deep Direct-Path Relative Transfer Function for Binaural Sound Source Localization

Binaural sound source localization using a hybrid time and frequency domain model

Deep Neural Network Based Audio Source Separation

DNN and Clustering Based Binaural Sound Source Localization in Mismatched HRTF Condition

Deep Learning Based Binaural Speech Separation in Reverberant Environments

Binaural Sound Source Localization Based on Sub-band SNR Estimation

Localization Based Stereo Speech Separation Using Deep Networks.

Sound source localization method based time-domain signal feature using deep learning

An Underwater Wideband Sound Source Localization Method Based on Light Neural Network Structure

A new hierarchical binaural sound source localization method based on Interaural Matching Filter

A Binaural Sound Source Localization Model Based on Time-Delay Compensation and Interaural Coherence

DNN-based Sound Source Localization Method with Microphone Array

Multiple Sound Sources Localization Using Sub-Band Spatial Features and Attention Mechanism

Binaural Classification for Reverberant Speech Segregation Using Deep Neural Networks

Speech Enhancement Based on Binaural Sound Source Localization and Cosh Measure Wiener Filtering