Abstract:While sound source localization (SSL) using a spherical microphone array system can be applied to obtain visual beam patterns of source distribution maps in a range of omnidirectional acoustic applications, the present challenges of the spherical measurement system on the valid frequency ranges and the spatial distortion as well as the grid-related limitations of data-driven SSL approaches raise the need to develop an appropriate method. Imbued by these challenges, this study proposes a deep learning (DL) approach to achieve the high-resolution performance of localizing multiple sound sources tailored for omnidirectional acoustic applications. First, we present a spherical target map representation that can panoramically pinpoint the position and strength information of multiple sound sources without any grid-related constraints. Then, a dual-branched spherical convolutional autoencoder is proposed to obtain high-resolution localization results from the conventional spherical beamforming maps while incorporating frequency-variant and distortion-invariant strategies to address the inherent challenges. We quantitatively and qualitatively assess our proposed method's localization capability for multiple sound sources and validate that the proposed method can achieve far more precise and computationally efficient results than the existing approaches. By extension, we newly present the experimental setup that can create omnidirectional acoustic scenarios for the multiple SSL. By evaluating our proposed method in this experimental setup, we demonstrate the effectiveness and applicability of the proposed method with the experimental data. Our study delivers the proposed approach's potential of being utilized in various SSL applications.

Deep Learning Based Two-dimensional Speaker Localization with Large Ad-hoc Microphone Arrays.

Deep Learning Based Stage-wise Two-dimensional Speaker Localization with Large Ad-hoc Microphone Arrays

Real-Time Space 3D Acoustic Location Based on Monte Carlo Algorithm

Three-dimensional Acoustic Localization Algorithm Based on Coordinate Conversion

ACP1–ADA1 interaction in type 2 diabetes: a study in coronary artery disease

DNN-based Sound Source Localization Method with Microphone Array

Deep Learning-Enabled High-Resolution and Fast Sound Source Localization in Spherical Microphone Array System

Deep learning-enhanced single point sound source localization for spherical microphone array

Deep Ad-hoc Beamforming Based on Speaker Extraction for Target-Dependent Speech Separation

Robust Three-Microphone Speech Source Localization Using Randomized Singular Value Decomposition

Towards End-to-End Acoustic Localization Using Deep Learning: From Audio Signals to Source Position Coordinates

Speech Activity Detection and Speaker Localization Based on Distributed Microphones.

A Deep Learning Method for DOA Estimation with Covariance Matrices in Reverberant Environments

Neural Ambisonic Encoding For Multi-Speaker Scenarios Using A Circular Microphone Array

Visually Supervised Speaker Detection and Localization via Microphone Array

Sound Localization Based on Acoustic Source Using Multiple Microphone Array in an Indoor Environment

Study on the Localization Method of Multi-Aperture Acoustic Array Based on TDOA

Organ transplantation in Poland.

Acoustic Source Localization in the Circular Harmonic Domain Using Deep Learning Architecture

Microphone Clustering and BP Network based Acoustic Source Localization in Distributed Microphone Arrays

A Two Microphone-Based Approach For Source Localization Of Multiple Speech Sources