Dual-Channel Speech Separation Using Interaural Time Difference with Generalized Gaussian Mixture Model

Zhaogui Ding,Liming Zhang,Longbiao Wang,Weifeng Li
DOI: https://doi.org/10.2991/icitmi-15.2015.194
2015-01-01
Abstract:In this letter we present a novel speech separation scheme using two microphones. The proposed method utilizes the estimation of interaural time difference (ITD) statistics for the separation of mixed speech sources. The novelties of this paper consist in the use of Generalized Gaussian Mixture Model (GGMM) for speech separation frame by frame and cross-correlation coefficient for distributed parameter selection. The proposed model can be extended to audio enhancement. Our objective quality evaluation experiments demonstrate the effectiveness of the proposed methods and show significant quality improvements over the conventional dual ITD based methods.
What problem does this paper attempt to address?