Data augmentation using generative adversarial networks for robust speech recognition.

Yanmin Qian,Hu Hu,Tian Tan
DOI: https://doi.org/10.1016/j.specom.2019.08.006
IF: 2.723
2019-01-01
Speech Communication
Abstract:•This paper utilizes three different GANs for data augmentation to improve speech recognition under noise conditions.•The experiments show that out proposed data augmentation approaches can obtain the performance improvement under all noisy conditions, which have additive noise, channel distortion and reverberation.•With the proposed approach, we can use GAN to generate more training data under noisy conditions, which can be used in multi-condition training of acoustic modeling in robust speech recognition.
What problem does this paper attempt to address?