Abstract:Existing speech source separation approaches overwhelmingly rely on acoustic pressure information acquired by using a microphone array. Little attention has been devoted to the usage of B-format microphones, by which both acoustic pressure and pressure gradient can be obtained, and therefore the direction of arrival (DOA) cues can be estimated from the received signal. In this paper, such DOA cues, together with the frequency bin-wise mixing vector (MV) cues, are used to evaluate the contribution of a specific source at each time frequency (T-F) point of the mixtures in order to separate the source from the mixture. Based on the von Mises mixture model and the complex Gaussian mixture model respectively, a source separation algorithm is developed, where the model parameters are estimated via an expectation-maximization (EM) algorithm. A T-F mask is then derived from the model parameters for recovering the sources. Moreover, we further improve the separation performance by choosing only the reliable DOA estimates at the T-F units based on thresholding. The performance of the proposed method is evaluated in both simulated room environments and a real reverberant studio in terms of signal-to-distortion ratio (SDR) and the perceptual evaluation of speech quality (PESQ). The experimental results show its advantage over four baseline algorithms including three T-F mask based approaches and one convolutive independent component analysis (ICA) based method. (C) 2015 Elsevier B.V. All rights reserved.

An Improved BLUES with Adaptive Threshold of Condition Number for Separating Underdetermined Speech Mixtures

Reverberant Speech Separation with Probabilistic Time-Frequency Masking for B-format Recordings.

A Blind Separation Algorithm of Speech Mixtures Base on Time-Frequency Masking

Cepstral Smoothing of Spectral Masks for Acoustic Vector-Sensor Based Convolutive Speech Separation

Research of adaptive speech separation method based on speech status detection

An Adaptive Single Channel EMD-TNMF Blind Source Separation Algorithm for Both Instantaneous and Convolutive Mixed Signal

Improved Source Counting and Separation for Monaural Mixture

Research on underdetermined speech blind separation based on attenuation and time-delay clustering estimation

Underdetermined Convolutive Blind Separation of Sources Integrating Tensor Factorization and Expectation Maximization.

Quasi-Blind Source Separation Algorithm for Convolutive Mixture of Speech

Underdetermined Reverberant Audio-Source Separation Through Improved Expectation–Maximization Algorithm

Underdetermined Blind Separation of Overlapped Speech Mixtures in Time-Frequency Domain with Estimated Number of Sources

Underdetermined Blind Source Separation of Speech Mixtures Based on K-means Clustering

A Novel Approach For Underdetermined Blind Speech Sources Separation

The Improved Method for Solving Permutation Problem in Frequency Domain Blind Source Separation of Speech Signals

Adaptive Speech Separation Based on Beamforming and Frequency Domain-Independent Component Analysis

Coarse-to-Fine Recursive Speech Separation for Unknown Number of Speakers

Underdetermined Blind Separation of Delayed Sound Source in the Time-frequency Domain

Improving Separation of Harmonic Sources with Iterative Estimation of Spatial Cues

Room Impulse Response Reshaping-Based Underdetermined Blind Source Separation in a Reverberant Environment

Underdetermined Blind Separation Based on Sound Source Time-Delay Estimation