Redundant Convolutional Network with Attention Mechanism for Monaural Speech Enhancement.

Tian Lan,Yilan Lyu,Guoqiang Hui,Refuoe Mokhosi,Sen Li,Qiao Liu
DOI: https://doi.org/10.1109/icassp40776.2020.9053277
2020-01-01
Abstract:The redundant convolutional encoder-decoder network has proven useful in speech enhancement tasks. It can capture localized time-frequency details of speech signals through both the fully convolutional network structure and feature selection capability resulting from the encoder-decoder mechanism. However, it does not explicitly consider the signal filtering mechanism, which we regard as important for speech enhancement models. In this study, we introduce an attention mechanism into the convolutional encoder-decoder model. This mechanism adaptively filters channel-wise feature responses by explicitly modeling attentions (on speech versus noise signals) between channels. Experimental results show that the proposed attention model is effective in capturing speech signals from background noise, and performs especially better in unseen noise conditions compared to other state-of-the-art models.
What problem does this paper attempt to address?