Abstract:Recently, many benchmark datasets have been found to contain noisy labels caused by unavoidable human mistakes. Many researchers propose new noise-aware loss functions to achieve robust classification performance. However, we discover that existing noise-aware loss functions cannot fully heal the damage caused by the noise. On the other hand, some methods filter out low confidence samples and train new models, whereas the filtered samples contain both noisy and hard samples that are critical for the robustness of models. Based on the above two discoveries, we devised the Noise-aware Network (NA-Net) for robust training with noisy labels. Each layer of NA-Net contains three groups of convolution kernels responsible for mix samples, clean samples, and noisy samples, termed as mix-kernels, clean-kernels, and noise-kernels, respectively. Mix-kernels are used for finding the clean samples with a newly devised noise-immune (NI) loss function; clean-kernels are targeted at learning better features without being misguided by noise; noise-kernels are trained by the remaining samples to rectify wrong labels for the next iteration. Meanwhile, for increasing the classification performance of mix-kernels, the extracted feature maps of clean-kernels without being poisoned are combined as the input of mix-kernels of the next layer. Also, the knowledge distillation strategy is adopted to distill the knowledge from clean-kernels to the noise-kernels. Extensive experiments demonstrate that the mutual promotion of three groups of kernels in NA-Net achieves state-of-the-art performance on both artificial noisy datasets and real-world datasets.

Noisy training for deep neural networks in speech recognition

DENOISPEECH: DENOISING TEXT TO SPEECH WITH FRAME-LEVEL NOISE MODELING

Noise is the Fatal Poison: A Noise-aware Network for Noisy Dataset Classification

Noisy-target Training: A Training Strategy for DNN-based Speech Enhancement without Clean Speech

Dynamic noise aware training for speech enhancement based on deep neural networks.

A regression approach to speech enhancement based on deep neural networks

Deep Neural Network Based Noised Asian Speech Enhancement and Its Implementation on a Hearing Aid App.

SNR-Based Features and Diverse Training Data for Robust DNN-Based Speech Enhancement

Transfer Learning for Acoustic Modeling of Noise Robust Speech Recognition

A Novel Training Strategy Using Dynamic Data Generation for Deep Neural Network Based Speech Enhancement.

Building DNN acoustic models for large vocabulary speech recognition

Training Recurrent Neural Networks against Noisy Computations during Inference

Denoising Noisy Neural Networks: A Bayesian Approach with Compensation.

Noisy Training Improves E2E ASR for the Edge

Boosting Noise Robustness of Acoustic Model via Deep Adversarial Training

Control System and Speech Recognition of Exhibition Hall Digital Media Based on Computer Technology

Speech Separation Based on Signal-Noise-dependent Deep Neural Networks for Robust Speech Recognition

On Generating Mixing Noise Signals With Basis Functions For Simulating Noisy Speech And Learning Dnn-Based Speech Enhancement Models

An Experimental Study on Speech Enhancement Based on Deep Neural Networks

Deep learning restores speech intelligibility in multi-talker interference for cochlear implant users

Deep Noise Tracking Network: A Hybrid Signal Processing/Deep Learning Approach to Speech Enhancement