The NPU-Elevoc Personalized Speech Enhancement System for ICASSP2023 DNS Challenge

Xiaopeng Yan,Yindi Yang,Zhihao Guo,Liangliang Peng,Lei Xie
DOI: https://doi.org/10.48550/arXiv.2303.06811
2023-03-13
Audio and Speech Processing
Abstract:This paper describes our NPU-Elevoc personalized speech enhancement system (NAPSE) for the 5th Deep Noise Suppression Challenge at ICASSP 2023. Based on the superior two-stage model TEA-PSE 2.0, our system particularly explores better strategy for speaker embedding fusion, optimizes the model training pipeline, and leverages adversarial training and multi-scale loss. According to the results, our system is tied for the 1st place in the headset track (track 1) and ranked 2nd in the speakerphone track (track 2).
What problem does this paper attempt to address?