Abstract:Aiming at solving the problems of the conventional minimum variance distortionless response (MVDR) beamformer in practical applications, such as the sensibility of the steering vector mismatch and beampattern distortion, a robust broadband MVDR beamforming method with low-latency by reconstructing covariance matrix is proposed and applied to speech enhancement with a linear microphone array in this paper. In this work, some important steps are optimized, and the main contribution is to consider the problem of correlation terms generated by the low latency. Firstly, the direction of arrival (DOA) is corrected and the steering vector is estimated based on the sparsity of the DOAs corresponding to the sound sources, which improves the ability of anti-mismatches in the steering vector. Secondly, the correlation terms between the sound sources and noise are estimated and eliminated by the Capon power within the eigen-subspace, and the indirect dominant method is used to eliminate the correlation terms between the sound sources, so that the covariance matrix is reconstructed to obtain a more robust MVDR beamformer. Thirdly, the problem of white noise amplification at low frequency bins is analyzed, and a white noise gain (WNG) modification method is proposed to obtain a compromise between the interference suppression and WNG. In the experiments, the TIMIT corpus is used to generate the multi-channel speech data set, and the performance of the proposed method is evaluated with different DOAs and input signal to interference plus noise ratios (SINRs). The experimental results show that the proposed method can effectively suppress the interferences and reduce the noise with strong robustness.

A Study of Learning Based Beamforming Methods for Speech Recognition

Deep Long Short-Term Memory Adaptive Beamforming Networks For Multichannel Robust Speech Recognition

Enhancing Mmwave Beam Prediction Through Deep Learning with Sub-6 GHz Channel Estimate

Robust speech recognition using beamforming with adaptive microphone gains and multichannel noise reduction

Deep Learning Based Speech Beamforming

Robust mmWave Beamforming by Self-Supervised Hybrid Deep Learning

Masking-based Neural Beamformer for Multichannel Speech Enhancement

A New Neural Beamformer for Multi-channel Speech Separation

A unified multichannel far-field speech recognition system: combining neural beamforming with attention based end-to-end model

A Corpus-Based Evaluation of Beamforming Techniques and Phase-Based Frequency Masking

Attention-Based Beamformer For Multi-Channel Speech Enhancement

Benefits of triple acoustic beamforming during speech-on-speech masking and sound localization for bilateral cochlear-implant users

Locate and Beamform: Two-dimensional Locating All-neural Beamformer for Multi-channel Speech Separation

A Deep Learning Approach to Location- and Orientation-aided 3D Beam Selection for mmWave Communications

Online/Offline Learning to Enable Robust Beamforming: Limited Feedback Meets Deep Generative Models

Noise Robust Speech Recognition Using Multi-Channel Based Channel Selection And ChannelWeighting.

Position-Aware Beam Training for Near-Field Milimeter-Wave XL-MIMO Communications

Deep Beamforming for Speech Enhancement and Speaker Localization with an Array Response-Aware Loss Function

Beam Profiling and Beamforming Modeling for mmWave NextG Networks

Design of a robust MVDR beamforming method with Low-Latency by reconstructing covariance matrix for speech enhancement

An Iterative Mask Estimation Approach to Deep Learning Based Multi-Channel Speech Recognition