Abstract:<p>Background noise and room reverberation often cause a decrease in reliability of binaural cues and speech quality, especially in non-stationary environment. In order to solve these problems, we propose a novel speech separation algorithm based on two-stage neural network model and a special separation mask in noisy-reverberant environment. In this algorithm, firstly, the weight matrix is derived to construct reliable binaural cues through the first-stage neural network. The reliable binaural cues combined with complementary spectral features is used as input of separation DNN. Secondly, a special separation mask is introduced for noisy-reverberant environment, which can suppress background noise and reduce reverberation. Thirdly, the separation DNN is used as nonlinear function to estimate separation mask. Then, the two-stage neural network system is trained jointly. During the joint training process, the system adaptively adjusts the weight matrix according to the final error, which is similar to the attention mechanism introduced for binaural features. At the same time, due to the increased reliability of binaural cues, neural networks can make better use of effective information. Finally, the estimated separation mask is used to weight the noisy-reverberant speech to achieve the enhanced speech. Experimental results indicate that the proposed algorithm achieves better performance than the contrast algorithms in different scenarios with various amounts of noise and reverberation.</p>

Neural-Based Separating Method for Nonlinear Mixtures

Nonlinear Blind Separation Using an RBF Network Model

Extraction Of Unique Independent Components For Nonlinear Mixture Of Sources

Nonlinear blind source separation using a genetic algorithm

Hopfield Neural Network Approach for Supervised Nonlinear Spectral Unmixing

Blind Source Separation of Nonlinearly Constrained Mixed Sources Using Polynomial Series Reversion

Neural Network Approaches to Nonlinear Blind Source Separation.

Weierstrass Approach to Blind Source Separation of Multiple Nonlinearly Mixed Signals

Neural Fast Full-Rank Spatial Covariance Analysis for Blind Source Separation

Blind Source Separation Based on Self-Organizing Neural Network.

End-to-end Networks for Supervised Single-channel Speech Separation

Learning nonlinear manifolds based on mixtures of localized linear manifolds under a self-organizing framework

Blind Separation of Noisy Mixed Speech Based on Independent Component Analysis and Neural Network

An MRF-ICA based algorithm for image separation

Blind Source Separation Using Mixtures of Alpha-Stable Distributions

A Neural Network Alternative to Non-Negative Audio Models

Speech separation based on reliable binaural cues with two-stage neural network in noisy-reverberant environments

Mutual Information Based Approach for Nonnegative Independent Component Analysis

Bilinear Mixture Models Based Unsupervised Nonlinear Unmixing Using Constrained Nonnegative Matrix Factorization

Latent Iterative Refinement for Modular Source Separation

Sound Source Separation Using Latent Variational Block-Wise Disentanglement