Abstract:Aiming at solving the problems of the conventional minimum variance distortionless response (MVDR) beamformer in practical applications, such as the sensibility of the steering vector mismatch and beampattern distortion, a robust broadband MVDR beamforming method with low-latency by reconstructing covariance matrix is proposed and applied to speech enhancement with a linear microphone array in this paper. In this work, some important steps are optimized, and the main contribution is to consider the problem of correlation terms generated by the low latency. Firstly, the direction of arrival (DOA) is corrected and the steering vector is estimated based on the sparsity of the DOAs corresponding to the sound sources, which improves the ability of anti-mismatches in the steering vector. Secondly, the correlation terms between the sound sources and noise are estimated and eliminated by the Capon power within the eigen-subspace, and the indirect dominant method is used to eliminate the correlation terms between the sound sources, so that the covariance matrix is reconstructed to obtain a more robust MVDR beamformer. Thirdly, the problem of white noise amplification at low frequency bins is analyzed, and a white noise gain (WNG) modification method is proposed to obtain a compromise between the interference suppression and WNG. In the experiments, the TIMIT corpus is used to generate the multi-channel speech data set, and the performance of the proposed method is evaluated with different DOAs and input signal to interference plus noise ratios (SINRs). The experimental results show that the proposed method can effectively suppress the interferences and reduce the noise with strong robustness.

Multi-channel Speech Enhancement Based on the MVDR Beamformer and Postfilter.

Attention-Based Beamformer For Multi-Channel Speech Enhancement

Neural Spatio-Temporal Beamformer for Target Speech Separation

Design of a robust MVDR beamforming method with Low-Latency by reconstructing covariance matrix for speech enhancement

Unsupervised Speech Enhancement Based on Multichannel NMF-Informed Beamforming for Noise-Robust Automatic Speech Recognition

Masking-based Neural Beamformer for Multichannel Speech Enhancement

Multichannel Speech Enhancement without Beamforming

Multi-channel end-to-end neural network for speech enhancement, source localization, and voice activity detection

ADL-MVDR: All deep learning MVDR beamformer for target speech separation

Adaptive Beamforming Based on Interference-Plus-Noise Covariance Matrix Reconstruction for Speech Separation

Beamforming and Lightweight GRU Neural Networkcombination Model for Multi-Channel Speech Enhancement

Cnn-Based Virtual Microphone Signal Estimation For Mpdr Beamforming In Underdetermined Situations

Multi-channel Multi-frame ADL-MVDR for Target Speech Separation

Deep Multi-Frame MVDR Filtering for Binaural Noise Reduction

Deep Interaction between Masking and Mapping Targets for Single-Channel Speech Enhancement

Robust Beamforming for Speech Recognition Using DNN-Based Time-Frequency Masks Estimation.

Modified Complementary Joint Sparse Representations: A Novel Post-Filtering to MVDR Beamforming.

Supervised Single-Channel Speech Enhancement Using Ratio Mask with Joint Dictionary Learning

Unsupervised Improved MVDR Beamforming for Sound Enhancement

A New Neural Beamformer for Multi-channel Speech Separation

Multi-channel Speech Enhancement based on Beamforming and GAN Network