Convolutional Recurrent MetriCGAN with Spectral Dimension Compression for Full-Band Speech Enhancement

Zhongshu Hou,Qinwen Hu,Tianchi Sun,Yuxiang Hu,Changbao Zhu,Kai Chen
DOI: https://doi.org/10.1109/icassp49357.2023.10095906
2023-01-01
Abstract:MetricGAN and its variations have been proven to be an effective wide-band speech enhancement model. In this paper, we expand it to full-band enhancement by combining our recently proposed learnable spectral dimension compression mapping strategy. The encoder-decoder structure with a time-frequency convolutional recurrent network is utilized as the generator. The proposed model is submitted to the ICASSP Signal Processing Grand Challenge: DNS-5 Challenge (2023). Without using the enrollment speech, it obtains a final score of 0.548 on Track-1 and 0.559 on Track-2.
What problem does this paper attempt to address?