Abstract:Aiming at solving the problems of the conventional minimum variance distortionless response (MVDR) beamformer in practical applications, such as the sensibility of the steering vector mismatch and beampattern distortion, a robust broadband MVDR beamforming method with low-latency by reconstructing covariance matrix is proposed and applied to speech enhancement with a linear microphone array in this paper. In this work, some important steps are optimized, and the main contribution is to consider the problem of correlation terms generated by the low latency. Firstly, the direction of arrival (DOA) is corrected and the steering vector is estimated based on the sparsity of the DOAs corresponding to the sound sources, which improves the ability of anti-mismatches in the steering vector. Secondly, the correlation terms between the sound sources and noise are estimated and eliminated by the Capon power within the eigen-subspace, and the indirect dominant method is used to eliminate the correlation terms between the sound sources, so that the covariance matrix is reconstructed to obtain a more robust MVDR beamformer. Thirdly, the problem of white noise amplification at low frequency bins is analyzed, and a white noise gain (WNG) modification method is proposed to obtain a compromise between the interference suppression and WNG. In the experiments, the TIMIT corpus is used to generate the multi-channel speech data set, and the performance of the proposed method is evaluated with different DOAs and input signal to interference plus noise ratios (SINRs). The experimental results show that the proposed method can effectively suppress the interferences and reduce the noise with strong robustness.

Speech Enhancement Integrating the MVDR Beamforming and T-F Masking

Multi-channel Speech Enhancement Based on the MVDR Beamformer and Postfilter.

Beamforming-based Speech Enhancement Based on Optimal Ratio Mask

An Implementaion of the CNN-Based MVDR Beamforming For Speech Enhancement.

Attention-Based Beamformer For Multi-Channel Speech Enhancement

Design of a robust MVDR beamforming method with Low-Latency by reconstructing covariance matrix for speech enhancement

Masking-based Neural Beamformer for Multichannel Speech Enhancement

Neural Spatio-Temporal Beamformer for Target Speech Separation

Unsupervised Speech Enhancement Based on Multichannel NMF-Informed Beamforming for Noise-Robust Automatic Speech Recognition

Single-channel speech enhancement using improved progressive deep neural network and masking-based harmonic regeneration

ADL-MVDR: All deep learning MVDR beamformer for target speech separation

Masks Fusion with Multi-Target Learning For Speech Enhancement

Deep Interaction between Masking and Mapping Targets for Single-Channel Speech Enhancement

Directional Gain Based Noise Covariance Matrix Estimation for MVDR Beamforming

A Speech Enhancement Algorithm By Iterating Single- And Multi-Microphone Processing And Its Application To Robust Asr

Robust Beamforming for Speech Recognition Using DNN-Based Time-Frequency Masks Estimation.

Joint Training Of Complex Ratio Mask Based Beamformer And Acoustic Model For Noise Robust Asr

Iteratively Refined Multi-Channel Speech Separation

Multi-resolution Auditory Cepstral Coefficient and Adaptive Mask for Speech Enhancement with Deep Neural Network

Multi-channel Multi-frame ADL-MVDR for Target Speech Separation

Distortionless Multi-Channel Target Speech Enhancement for Overlapped Speech Recognition