Application of improved U-Net network with attention mechanism in end-to-end speech enhancement

WU Ruiqin,CHEN Xueqin,YU Jie,WANG Lirong,ZHAO Heming
2022-01-01
Abstract:An improved U-Net(Attention Dilated Convolution U-Net,ADC-U-Net)network model for end-to-end speech enhancement is designed based on the U-Net network.Compared with the baseline U-Net network,the dilated convolution is added to reduce the loss of infor-mation caused by sampling.Besides,the attention mechanism structure is introduced,which combines more contextual information of noisy speech to extract deeper and richer feature in-formation.The proposed model avoids the extraction of features with distinct step,so it does not need three steps of traditional methods,including feature extraction,feature denoising and waveform reconstruction.The proposed network model obtains complex structural features to represent speech through multi-level and multi-scale learning.The quality and intelligibility of enhanced speech are evaluated by several subjective and objective indexes.Experimental data show that the proposed algorithm performs well in noise suppression and adaptability,and has advantage over baseline U-Net network and other models in speech quality and intelligibility.
What problem does this paper attempt to address?