Amplitude Consistent Enhancement for Speech Dereverberation.

Chunlei Liu,Longbiao Wang,Jianwu Dang
DOI: https://doi.org/10.1145/3404555.3404618
2020-01-01
Abstract:The mapping and masking methods based on deep learning are both essential methods for speech dereverberation at present, which typically enhance the amplitude of the reverberant speech while letting the reverberant phase unprocessed. The reverberant phase and enhanced amplitude are used to synthesize the target speech. However, because the overlapping frames interfere with each other during the superposition process (overlap-and-add), the final synthesized speech signal will deviate from the ideal value. In this paper, we propose an amplitude consistent enhancement method (ACE) to solve this problem. With ACE to train the deep neural networks (DNNs), we use the difference between amplitudes of the synthesized and clean speech as the loss function. Also, we propose a method of adding an adjustment layer to improve the regression accuracy of DNN. The speech dereverberation experiments show that the proposed method has improved the PESQ and SNR by 5% and 15% compared with the traditional signal approximation method.
What problem does this paper attempt to address?