Combination of Smoothing Filter and Subband Processing for Double Simultaneous Speaker Localization

Li-xia HUANG,Dan-fei ZAN,Sui-sui ZHANG,Xue-ying ZHANG
DOI: https://doi.org/10.3969/j.issn.1006-9348.2018.05.077
2018-01-01
Abstract:In order to improve the performance of multiple source localization in reverberant situations,we propose a localization method for multiple speakers with smoothing generalized cross correlation (SGCC) based on sub-band processing.The method takes the advantage of sparseness of speech signal in time-frequency domain.Firstly,the speech signal was divided into different sub-bands,and the smoothing generalized cross correlation was calculated in each sub-band.Then,the weighted average method was used to fuse the time delay estimation of each sub-band.Finally,the three-dimensional position of each sound source was obtained by geometric localization algorithm.The first-order smoothing filter was used to perform multi-frame weighted smoothing on the cross-power spectrum function,eliminating fluctuations only estimated by the current frame.The simulation results show that proposed algorithm achieves high localization accuracy in the reverberant environment and the algorithm is superior to the traditional algorithm.
What problem does this paper attempt to address?