Exploiting the directional coherence function for multichannel source extraction
Shan Liang,Guanjun Li,Shuai Nie,ZhanLei Yang,WenJu Liu,Jianhua Tao
DOI: https://doi.org/10.1016/j.specom.2021.01.002
IF: 2.723
2021-01-01
Speech Communication
Abstract:The desired speech detector plays an important role for controlling the speech distortion in spatial filtering based speech enhancement algorithms. However, the conventional complex coherence(CC) based algorithms can only distinguish the coherent speech and diffuse noise. To improve the performance on the scenarios that both the coherent interference and diffuse noise are present, we propose a directional coherence function(DCF) based detector in this paper. Based on a pair of complementary filters which can suppress the diffuse noise and the coherent interference respectively, the DCF is computed as the normalized correlation between the filters? outputs. Meanwhile, the filters are solved by convex programming method and satisfy the constraints on speech distortionless and white noise gain(WNG). Consequently, the value of DCF will be close to 1 only for the desired speech dominated time-frequency(T-F) bins and much smaller than 1 for the noise or interference dominated T-F bins. To extract the desired speech, the DCF based Desired Speech Presence Probability(DSPP) is used to control the adaptation in general sidelobe canceler(GSC), and subsequently used as the post-filtering weight. Systematical experiments on several scenarios show that the proposed algorithm achieves significantly and consistently better noise suppression performance than the narrowband direction-of-arrival(DOA) estimates based algorithms.