Abstract:Robust loss minimization is an important strategy for handling robust learning issue on noisy labels. Current approaches for designing robust losses involve the introduction of noise-robust factors, i.e., hyperparameters, to control the trade-off between noise robustness and learnability. However, finding suitable hyperparameters for different datasets with noisy labels is a challenging and time-consuming task. Moreover, existing robust loss methods usually assume that all training samples share common hyperparameters, which are independent of instances. This limits the ability of these methods to distinguish the individual noise properties of different samples and overlooks the varying contributions of diverse training samples in helping models understand underlying patterns. To address above issues, we propose to assemble robust loss with instance-dependent hyperparameters to improve their noise tolerance with theoretical guarantee. To achieve setting such instance-dependent hyperparameters for robust loss, we propose a meta-learning method which is capable of adaptively learning a hyperparameter prediction function, called noise-aware-robust-loss-adjuster (NARL-Adjuster). Through mutual amelioration between hyperparameter prediction function and classifier parameters in our method, both of them can be simultaneously finely ameliorated and coordinated to attain solutions with good generalization capability. Four SOTA robust loss functions are attempted to be integrated with our algorithm, and comprehensive experiments substantiate the general availability and effectiveness of the proposed method in both its noise tolerance and performance. Meanwhile, the explicit parameterized structure makes the meta-learned prediction function ready to be transferrable and plug-and-play to unseen datasets with noisy labels. Specifically, we transfer our meta-learned NARL-Adjuster to unseen tasks, including several real noisy datasets, and achieve better performance compared with conventional hyperparameter tuning strategy, even with carefully tuned hyperparameters.

Adaptive Dropout Rates for Learning with Corrupted Features

Noise is the Fatal Poison: A Noise-aware Network for Noisy Dataset Classification

A Label Noise Robust Stacked Auto-Encoder Algorithm for Inaccurate Supervised Classification Problems

Removing the Feature Correlation Effect of Multiplicative Noise

Feature Noise Induces Loss Discrepancy Across Groups

Dropout Training for Support Vector Machines

Noise Modeling with Associative Corruption Rules

Learning with Feature-Dependent Label Noise: A Progressive Approach

A Robust Classifier for Noise-corruption Learning

Beyond Dropout: Feature Map Distortion to Regularize Deep Neural Networks

USDNL: Uncertainty-Based Single Dropout in Noisy Label Learning.

Compressing Features for Learning With Noisy Labels

Improve Noise Tolerance of Robust Loss via Noise-Awareness

DAT: Training Deep Networks Robust to Label-Noise by Matching the Feature Distributions

A robust adaptive linear regression method for severe noise

The Risk of Federated Learning to Skew Fine-Tuning Features and Underperform Out-of-Distribution Robustness

Adaptive Robust Learning using Latent Bernoulli Variables

Dropout Training for SVMs with Data Augmentation

Feature Contamination: Neural Networks Learn Uncorrelated Features and Fail to Generalize

Balanced self-paced learning with feature corruption

Generalized Correntropy Based Deep Learning in Presence of Non-Gaussian Noises.