Abstract:Imbalanced learning is a traditional yet hot research subarea in machine learning. There are a huge number of imbalanced learning methods proposed in previous literature. This study focuses on one of the most popular imbalanced learning strategies, namely, sample reweighting. The key issue is how to calculate the weights of samples in training. While most studies have relied on intuitive theoretical or heuristic inspirations, few studies have attempted to establish a comprehensive theoretical path for weight calculation. A recent study utilizes the effective number theory for random covering to construct a theoretical weighting framework. In this study, we conduct a deep analysis to theoretically reveal the defects in the existing effective number-based weighting theory. An enhanced effective number theory is established in which data scatter and covering offset among different categories are involved. Subsequently, a new weight calculation manner is proposed based on our new theory, yielding a new loss, namely, NENum loss. In this loss, weights are sample-wise instead of category-wise used in the existing effective number-based weighting. Furthermore, another novel loss that combines weighting and logit perturbation is designed inspired the limitations of the NENum loss. Meta learning is employed to optimize the concrete calculation based on sample-wise training dynamics. We conduct extensive experiments on benchmark imbalanced and standard data corpora. Results validate the reasonableness of our enhanced theory and the effectiveness of the proposed methodology.

Re-Weighted Interval Loss for Handling Data Imbalance Problem of End-to-End Keyword Spotting.

A Novel Re-weighted CTC Loss for Data Imbalance in Speech Keyword Spotting

Keyword Spotting Based on Hypothesis Boundary Realignment and State-Level Confidence Weighting

Weighted Cluster-Range Loss and Criticality-Enhancement Loss for Speaker Recognition

On Front-end Gain Invariant Modeling for Wake Word Spotting

A Two-Step Keyword Spotting Method Based on Context-Dependent a Posteriori Probability

Sample Weighting: an Inherent Approach for Outlier Suppressing Discriminant Analysis

Revisiting the Loss Weight Adjustment in Object Detection

Revisiting the Effective Number Theory for Imbalanced Learning

GE2E-KWS: Generalized End-to-End Training and Evaluation for Zero-shot Keyword Spotting

Inverse Weight-Balancing for Deep Long-Tailed Learning

Disentangled Training with Adversarial Examples For Robust Small-footprint Keyword Spotting

Avoid Overfitting User Specific Information in Federated Keyword Spotting

Adversarial training of Keyword Spotting to Minimize TTS Data Overfitting

Audio-visual Keyword Spotting for Mandarin Based on Discriminative Local Spatial-Temporal Descriptors.

Multilingual Query-by-Example Keyword Spotting with Metric Learning and Phoneme-to-Embedding Mapping

AB/BA analysis: A framework for estimating keyword spotting recall improvement while maintaining audio privacy

DCCRN-KWS: an audio bias based model for noise robust small-footprint keyword spotting

U2-KWS: Unified Two-pass Open-vocabulary Keyword Spotting with Keyword Bias

Dark Experience for Incremental Keyword Spotting

Audio-visual Keyword Spotting Based on Adaptive Decision Fusion under Noisy Conditions for Human-Robot Interaction.