Abstract:Data augmentation (DA) has been widely used to improve the generalization of deep neural networks. While existing DA methods have proven effective, they often rely on augmentation operations with random magnitudes to each sample. However, this approach can inadvertently introduce noise, induce distribution shifts, and increase the risk of overfitting. In this paper, we propose EntAugment, a tuning-free and adaptive DA framework. Unlike previous work, EntAugment dynamically assesses and adjusts the augmentation magnitudes for each sample during training, leveraging insights into both the inherent complexities of training samples and the evolving status of deep models. Specifically, in EntAugment, the magnitudes are determined by the information entropy derived from the probability distribution obtained by applying the softmax function to the model's output. In addition, to further enhance the efficacy of EntAugment, we introduce a novel entropy regularization term, EntLoss, which complements the EntAugment approach. Theoretical analysis further demonstrates that EntLoss, compared to traditional cross-entropy loss, achieves closer alignment between the model distributions and underlying dataset distributions. Moreover, EntAugment and EntLoss can be utilized separately or jointly. We conduct extensive experiments across multiple image classification tasks and network architectures with thorough comparisons of existing DA methods. Importantly, the proposed methods outperform others without introducing any auxiliary models or noticeable extra computational costs, highlighting both effectiveness and efficiency. Code is available at <a class="link-external link-https" href="https://github.com/Jackbrocp/EntAugment" rel="external noopener nofollow">this https URL</a>.

Inversion Circle Interpolation: Diffusion-based Image Augmentation for Data-scarce Classification

Boosting Unsupervised Contrastive Learning Using Diffusion-Based Data Augmentation from Scratch

Decoupled Data Augmentation for Improving Image Classification

DIAGen: Diverse Image Augmentation with Generative Models

Effective Data Augmentation With Diffusion Models

Enhance Image Classification via Inter-Class Image Mixup with Diffusion Model

DreamDA: Generative Data Augmentation with Diffusion Models

DiffAug: A Diffuse-and-Denoise Augmentation for Training Robust Classifiers

DAug: Diffusion-based Channel Augmentation for Radiology Image Retrieval and Classification

A Simple Background Augmentation Method for Object Detection with Diffusion Model

Diffusion Curriculum: Synthetic-to-Real Generative Curriculum Learning via Image-Guided Diffusion

EntAugment: Entropy-Driven Adaptive Data Augmentation Framework for Image Classification

Data Augmentation Based on DiscrimDiff for Histopathology Image Classification

Detail Reinforcement Diffusion Model: Augmentation Fine-Grained Visual Categorization in Few-Shot Conditions

Image retrieval outperforms diffusion models on data augmentation

DDA: A Dynamic Difficulty-aware Data Augmenter for Image Super-resolution

DALDA: Data Augmentation Leveraging Diffusion Model and LLM with Adaptive Guidance Scaling

Diff-2-in-1: Bridging Generation and Dense Perception with Diffusion Models

DiffuseMix: Label-Preserving Data Augmentation with Diffusion Models

Diverse Generation while Maintaining Semantic Coordination: A Diffusion-Based Data Augmentation Method for Object Detection