IDANet: An Information Distillation and Aggregation Network for Speech Enhancement

Wenxin Tai,Tian Lan,Qianhui Wang,Qiao Liu
DOI: https://doi.org/10.1109/LSP.2021.3114122
2021-01-01
IEEE Signal Processing Letters
Abstract:Speech enhancement aims to restore clean speech from noisy environments. In recent years, skip connections have shown great promise in improving speech enhancement performance. Although directly transmitting low-level information is helpful for reconstructing the spectrum, the noise components negatively impact denoising results. In this letter, we propose IDANet, an end-to-end framework that incorporates an information distillation and aggregation unit to store fine-grained features and filter out noisy components through continuous distillation and recalibration. In addition, we design a novel decoding block equipped with deformable convolution and dynamic attention mechanism to further improve the capability of the reconstruction unit. Experimental results conducted on TIMIT corpus demonstrate that the proposed IDANet is efficient yet effective, e.g., the parameters of our model against the state-of-the-art model are 0.68 M vs. 1.23 M, and the performance boost on STOI and PESQ is 0.81% and 3.73%.
What problem does this paper attempt to address?