Abstract:A blind dereverberation method based on power spectral subtraction (SS) using a multi-channel least mean squares algorithm was previously proposed to suppress the reverberant speech without additive noise. The results of isolated word speech recognition experiments showed that this method achieved significant improvements over conventional cepstral mean normalization (CMN) in a reverberant environment. In this paper, we propose a blind dereverberation method based on generalized spectral subtraction (GSS), which has been shown to be effective for noise reduction, instead of power SS. Furthermore, we extend the missing feature theory (MFT), which was initially proposed to enhance the robustness of additive noise, to dereverberation. A one-stage dereverberation and denoising method based on GSS is presented to simultaneously suppress both the additive noise and nonstationary multiplicative noise (reverberation). The proposed dereverberation method based on GSS with MFT is evaluated on a large vocabulary continuous speech recognition task. When the additive noise was absent, the dereverberation method based on GSS with MFT using only 2 microphones achieves a relative word error reduction rate of 11.4 and 32.6% compared to the dereverberation method based on power SS and the conventional CMN, respectively. For the reverberant and noisy speech, the dereverberation and denoising method based on GSS achieves a relative word error reduction rate of 12.8% compared to the conventional CMN with GSS-based additive noise reduction method. We also analyze the effective factors of the compensation parameter estimation for the dereverberation method based on SS, such as the number of channels (the number of microphones), the length of reverberation to be suppressed, and the length of the utterance used for parameter estimation. The experimental results showed that the SS-based method is robust in a variety of reverberant environments for both isolated and continuous speech recognition and under various parameter estimation conditions.

Supervised Single-Channel Speech Dereverberation And Denoising Using A Two-Stage Processing

Supervised Single-Channel Speech Dereverberation and Denoising Using a Two-Stage Model Based Sparse Representation.

Multichannel Online Dereverberation based on Spectral Magnitude Inverse Filtering

A new method of speech dereverberation of single channel with cepstral processing

Know Your Enemy, Know Yourself: A Unified Two-Stage Framework for Speech Enhancement

Single-channel Dereverberation for Distant-Talking Speech Recognition by Combining Denoising Autoencoder and Temporal Structure Normalization

Single Channel Speech Dereverberation and Separation Using RPCA and SNMF

Convolutive Prediction for Monaural Speech Dereverberation and Noisy-Reverberant Speaker Separation

Joint Training for Simultaneous Speech Denoising and Dereverberation with Deep Embedding Representations

Frequency-domain Dereverberation on Speech Signal Using Surround Retinex

Simultaneous Denoising and Dereverberation Using Deep Embedding Features

Blind MultiChannel Identification and Equalization for Dereverberation and Noise Reduction based on Convolutive Transfer Function

Complex cepstrum based single channel speech dereverberation

Bifurcation and Reunion: A Loss-Guided Two-Stage Approach for Monaural Speech Dereverberation

Speech Dereverberation of Single Channel

Dereverberation based on Minimum Phase Decomposition

Monaural Speech Dereverberation using Deformable Convolutional Networks

Densely Connected Multi-Stage Model with Channel Wise Subband Feature for Real-Time Speech Enhancement.

A neural network-supported two-stage algorithm for lightweight dereverberation on hearing devices

Audio-Visual Speech Separation and Dereverberation With a Two-Stage Multimodal Network

Dereverberation and Denoising Based on Generalized Spectral Subtraction by Multi-Channel Lms Algorithm Using A Small-Scale Microphone Array