A DNN filtering-based sound source localization method

Qingsong Zhang,Hongqing Pan,Quanjun Song,Junchun Wang
DOI: https://doi.org/10.1109/ICSP58490.2023.10248493
2023-01-01
Abstract:This paper proposes a sound source localization method based on DNN filtering that consists of three parts, namely speech activity detection (VAD), spectral map-based DNN filtering, and sound source localization. First, VAD is used to split the received audio into speech and non-speech segments to reduce computation and reduce the impact of non-speech segments on localization results. Second, filtering of the speech segments is performed to effectively reduce the effect of the low signal-to-noise ratio of the sound source. Lastly, the GCC-phat method is used to supplement the fast localization of sound sources. Experimental results demonstrate that this method has good localization performance under low signal-to-noise ratios.
What problem does this paper attempt to address?