Two-Microphones Speech Separation Using Generalized Gaussian Mixture Model

Miao Fan,Jia-Min Mao,Jao-Gui Ding,Wei-Feng Li
DOI: https://doi.org/10.1515/9783110584974-040
2017-01-01
Abstract:In this paper we present a novel spatial speech separation scheme by using two microphones. The technique utilizes the estimation of interaural time difference (ITD) statistics for the separation of mixed speech sources. The novelties of the paper consist in the use of Generalized Gaussian Mixture Model (GGMM) for speech separation frame by frame and cross- correlation coefficient for distributed parameter selection. These are done frame-by-frame, which provides a dynamically changing time-frequency masking. The proposed model can be extended to audio enhancement. Our objective quality evaluation experiments demonstrate the effectiveness of the proposed methods and show significant quality improvements over the conventional ICA and dual ITD based methods.
What problem does this paper attempt to address?