DFBNet: Deep Neural Network Based Fixed Beamformer for Multi-channel Speech Separation

Ruqiao Liu,Yi Zhou,Hongqing Liu,Xinmeng Xu,Jie Jia,Binbin Chen
DOI: https://doi.org/10.1109/sips52927.2021.00042
2021-01-01
Abstract:The deep neural networks (DNNs) based beamformers have achieved significant improvements in speech separation tasks. This paper proposes a novel deep neural network (DNN) based fixed beamformer (DFBNet) that uniformly samples the space as a learning module. In addition, the initial coefficients of fixed beamformers in DFBNet are determined by the existing superdirective beamformer. Furthermore, to obtain the beams that related to each speaker, the proposed model has introduced a speech source estimation model, dual-path RNN (DPRNN), and an attention mechanism. The experimental results show that in the separation task with reverberation, the proposed way has better performance on scale-invariant signal-to-noise ratio (SI-SNR) and perceptual evaluation of speech quality (PESQ) than DPRNN and filter-and-sum network (FasNet) which is currently the most state-of-the-art temporal neural beamformer.
What problem does this paper attempt to address?