Abstract:For specific wideband sound source localization tasks performed in underwater environments, high precision, low computational complexity, and high robustness are required to meet the demand for accurate location in real-time processes. Traditional sound source localization methods that rely on models and signal processing techniques perform poorly in complex but common scenarios where noise, reverberation, and array deviations exist. Sound source localization systems based on deep neural networks are proposed recently and show superiority over conventional localization methods. However, lighter neural network structures for common wideband signals localization have been underexplored, which are crucial for improving the operational efficiency and practicability in underwater localization systems. In this paper, we propose a light neural network structure to locate the typical underwater wideband sound sources in communication and localization systems. The network mainly consists of convolutional and residual blocks. We train the network with simulated single tones and finetune it with multiple sets of single tones received in the experiment. As for the training feature, we choose the phase component of the covariance matrix and reshape the upper triangle into one column to reduce the feature dimension. The test signals include chirps, single-carrier modulation quadrature phase shift keying(SC-QPSK), multi-carrier modulations such as multi-tone and orthogonal frequency division multiplexing(OFDM). Both the simulation and experiment results have verified the high accuracy and robustness of our proposed method. Compared with the state-of-the-arts, our method obtains superior performance, especially in scenarios with low signal-to-noise ratio (SNR).

Sparse DNN Model for Frequency Expanding of Higher Order Ambisonics Encoding Process

Anti Spatial Aliasing HOA Encoding Method Based on Aliasing Projection Matrix

Sound Source Localization in Spherical Harmonics Domain Based on High-Order Ambisonics Signals Enhancement Neural Network

A robust super-resolution approach with sparsity constraint for near-field wideband acoustic imaging

Acoustic Field Visualization and Source Localization Via Physics-Informed Learning of Sparse Data with Adaptive Sampling

A Robust Super-Resolution Approach with Sparsity Constraint in Acoustic Imaging

Direct source and early reflections localization using deep deconvolution network under reverbrate environment

Direct source and early reflections localization using deep deconvolution network under reverberant environment

Frequency Domain Singular Value Decomposition for Efficient Spatial Audio Coding

Neural Ambisonic Encoding For Multi-Speaker Scenarios Using A Circular Microphone Array

An Underwater Wideband Sound Source Localization Method Based on Light Neural Network Structure

Improving Spatial Resolution of First-order Ambisonics Using Sparse MDCT Representation

Hierarchical Modeling of Spatial Cues via Spherical Harmonics for Multi-Channel Speech Enhancement

Achieving the sparse acoustical holography via the sparse bayesian learning

SpatialNet: Extensively Learning Spatial Information for Multichannel Joint Speech Separation, Denoising and Dereverberation

Blind Estimation of Sub-band Acoustic Parameters from Ambisonics Recordings using Spectro-Spatial Covariance Features

A new algorithm on hierarchical sparse signal reconstruction

A Sparse Spherical Harmonic-Based Model in Subbands for Head-Related Transfer Functions.

Ambisonizer: Neural Upmixing as Spherical Harmonics Generation

Perceptually-motivated Spatial Audio Codec for Higher-Order Ambisonics Compression

Sparse reconstruction of sound field using pattern-coupled Bayesian compressive sensing