A Steered Response Power Approach with Bilinear Prediction-Based Trade-Off Prewhitening for Speaker Localization
Zhiheng Wang,Hongsen He,Jingdong Chen,Jacob Benesty,Yi Yu
DOI: https://doi.org/10.1109/icassp48485.2024.10448270
2024-01-01
Abstract:This paper studies the problem of acoustic source localization in room environments. It presents an improved steered response power (SRP) approach with low-complexity and trade-off prewhitening. This method consists of two steps. In the first one, the linear predictor that is used to model the speech signals is formulated as a bilinear form, and a group of convex-constrained linear prediction sub-models with respect to dual sub-predictors are established to pre-filter microphone signals. The pre-filtered (prewhitened) microphone signals are subsequently used in SRP for speaker localization. Simulation results demonstrate the properties of the presented method: it is robust to reverberation and noise, and is computationally efficient thanks to the bilinear form.
What problem does this paper attempt to address?