A Loss with Mixed Penalty for Speech Enhancement Generative Adversarial Network

Jie Cao,Yaofeng Zhou,Hong Yu,Xiaoxu Li,Dan Wang,Zhanyu Ma
DOI: https://doi.org/10.1109/apsipaasc47483.2019.9023273
2019-01-01
Abstract:Speech enhancement based on generative adversarial networks (GANs) can overcome the problems of many classical speech enhancement methods, such as relying on the first-order statistics of signals and ignoring the phase mismatch between the noisy and the clean signals. However, GANs are hard to train and have the vanishing gradients problem which may lead to generate poor samples. In this paper, we propose a relativistic average least squares loss function with a mixed penalty term for speech enhancement generative adversarial network. The mixed penalty term can minimize the distance between generated and clean samples more effectively. Experimental results on Valentini 2016 and Valentini 2017 dataset show that the proposed loss can make the training of GAN more stable, and achieves good performance in both objective and subjective evaluation.
What problem does this paper attempt to address?