Abstract:A large amount of randomly generated noise in mobile networks leads to a lack of targeting and gaming processes in the speech enhancement process, and the enhancement process from the perspective of acoustic features alone suffers from major drawbacks. Propose a single-channel speech quality enhancement method based on generative adversarial networks in mobile networks. Explain the principle of generative adversarial network to realize single-channel speech quality enhancement in mobile networks and clarify its shortcomings. Design an improved Mel frequency cepstral coefficient extraction method to extract 12 base features as the enhancement basis. Use the relative average least squares loss instead of the traditional loss function to enhance the training efficiency, use the hybrid penalty term to enhance the generator's ability to generate single-channel speech, and optimize the discriminator through the multi-layer convolution and the addition of fully connected layers to enhance the speech quality enhancement ability of adversarial generative networks in various aspects, forming a relative average generative adversarial network (RaGAN) based on hybrid penalty term to realize single-channel speech quality enhancement processing. Through the experiment, when the discriminator is applied with the size of a 3*3 convolutional kernel, the best effect of speech quality enhancement is achieved in the mobile network. This method can complete the enhancement of single-channel speech quality in the mobile network, and the effect is significant, which can effectively reduce the noise in the original single-channel speech.

A Loss with Mixed Penalty for Speech Enhancement Generative Adversarial Network

On Loss Functions and Recurrency Training for GAN-based Speech Enhancement Systems

Towards Generalized Speech Enhancement with Generative Adversarial Networks

Double Adversarial Network Based Monaural Speech Enhancement for Robust Speech Recognition.

MetricGAN: Generative Adversarial Networks based Black-box Metric Scores Optimization for Speech Enhancement

SCP-GAN: Self-Correcting Discriminator Optimization for Training Consistency Preserving Metric GAN on Speech Enhancement Tasks

Single-Channel Speech Quality Enhancement in Mobile Networks Based on Generative Adversarial Networks

Coarse-to-fine Optimization for Speech Enhancement

Statistical parametric speech synthesis using generative adversarial networks under a multi-task learning framework

Multi-Metric Optimization using Generative Adversarial Networks for Near-End Speech Intelligibility Enhancement

Exploring Speech Enhancement with Generative Adversarial Networks for Robust Speech Recognition

A Speech Enhancement Method Based on Dual-Path Phase-Aware GAN Networks

MCGAN: Enhancing GAN Training with Regression-Based Generator Loss

Conditional Generative Adversarial Networks for Speech Enhancement and Noise-Robust Speaker Verification

Improve GAN-based Neural Vocoder using Pointwise Relativistic LeastSquare GAN

Incremental Focal Loss GANs.

Low-latency Speech Enhancement via Speech Token Generation

Enhancing Gappy Speech Audio Signals with Generative Adversarial Networks

MetricGAN+: An Improved Version of MetricGAN for Speech Enhancement

Sdgan: Improve Speech Enhancement Quality by Information Filter

Study of GANs for Noisy Speech Simulation from Clean Speech