Abstract:Despite substantial progress in the field of deep learning, overfitting persists as a critical challenge, and data augmentation has emerged as a particularly promising approach due to its capacity to enhance model generalization in various computer vision tasks. While various strategies have been proposed, Mixed Sample Data Augmentation (MSDA) has shown great potential for enhancing model performance and generalization. We introduce a novel mixup method called MiAMix, which stands for Multi-stage Augmented Mixup. MiAMix integrates image augmentation into the mixup framework, utilizes multiple diversified mixing methods concurrently, and improves the mixing method by randomly selecting mixing mask augmentation methods. Recent methods utilize saliency information and the MiAMix is designed for computational efficiency as well, reducing additional overhead and offering easy integration into existing training pipelines. We comprehensively evaluate MiaMix using four image benchmarks and pitting it against current state-of-the-art mixed sample data augmentation techniques to demonstrate that MIAMix improves performance without heavy computational overhead.

What problem does this paper attempt to address?

### Problems Addressed by the Paper This paper aims to address the issue of overfitting in the field of deep learning. Despite significant advancements in computer vision tasks such as image classification, image segmentation, and object detection, overfitting remains a key challenge. Data augmentation has emerged as a particularly promising method for improving model generalization. However, existing Mixed Sample Data Augmentation (MSDA) methods, while showing potential for enhancing model performance and generalization, still face some limitations in practical applications, such as high computational costs and difficulty in seamless integration into existing training pipelines. To this end, the authors propose a new mixed method called MiAMix (Multi-stage Augmented Mixup), which aims to improve the performance of image classification tasks by enhancing mixed samples through multiple stages. The MiAMix method not only integrates image augmentation techniques but also introduces a variety of diverse mixing methods and improves the mixing method by randomly selecting mixing mask augmentation methods. Additionally, MiAMix is designed for computational efficiency, reducing extra overhead, and is easy to integrate into existing training pipelines. ### Main Contributions 1. **Redesign of GMix**: The authors first redesigned the GMix method and proposed an enhanced version called AGMix. AGMix leverages the flexibility of Gaussian kernels to generate more diverse mixed outputs. 2. **Novel Mixing Ratio Sampling Method**: A new mixing ratio sampling method was designed, suitable for multiple mixing masks. 3. **Multi-stage Mixing Framework**: A multi-stage MSDA method was defined, including stages such as random sample pairing, mixing methods and ratio sampling, mixing mask generation and augmentation, and the final mixed sample output stage. These stages are integrated into a comprehensive framework called MiAMix, and a search space containing multiple hyperparameters was established. ### Experimental Results The authors validated the performance of MiAMix through a series of rigorous evaluations on the CIFAR-10, CIFAR-100, and Tiny-ImageNet datasets. Experimental results show that MiAMix consistently outperforms existing mixed sample data augmentation methods in these benchmarks, setting new performance standards. Additionally, the authors evaluated the robustness of the model in the presence of natural noise, and the experimental results indicate that MiAMix significantly improves the model's robustness. ### Abstract Despite substantial progress in the field of deep learning, overfitting remains a key challenge. Data augmentation has emerged as a particularly promising method for improving model generalization. Although various strategies have been proposed, Mixed Sample Data Augmentation (MSDA) methods have shown great potential in enhancing model performance and generalization. This paper introduces a new mixup method called MiAMix, which is a multi-stage augmented mixup. MiAMix integrates image augmentation techniques into the mixup framework, uses a variety of diverse mixing methods, and improves the mixing method by randomly selecting mixing mask augmentation methods. MiAMix is also designed for computational efficiency, reducing extra overhead, and is easy to integrate into existing training pipelines. We comprehensively evaluate MiAMix through four image benchmarks and comparisons with current state-of-the-art mixed sample data augmentation techniques, demonstrating that MiAMix improves performance without adding significant computational overhead.

MiAMix: Enhancing Image Classification through a Multi-stage Augmented Mixed Sample Data Augmentation Method

TransformMix: Learning Transformation and Mixing Strategies from Data

AutoMix: Unveiling the Power of Mixup for Stronger Classifiers

Unveiling the Power of Mixup for Stronger Classiﬁers

A New Data Augmentation Method Based on Mixup and Dempster-Shafer Theory

ResizeMix: Mixing Data with Preserved Object Information and True Labels

WeMix: How to Better Utilize Data Augmentation

Data Augmentation Using Mixup and Random Erasing

LMix:Regularization Strategy for Convolutional Neural Networks

RandoMix: a mixed sample data augmentation method with multiple mixed modes

Adversarial AutoMixup

Research Based on Mixed Sample Data Augmentation Method

LGCOAMix: Local and Global Context-and-Object-Part-Aware Superpixel-Based Data Augmentation for Deep Visual Recognition

ASMix: an Attention-based Smooth Data Augmentation Approach.

A Survey on Mixup Augmentations and Beyond

Global Mixup: Eliminating Ambiguity with Clustering.

Harnessing Hard Mixed Samples with Decoupled Regularizer

Decoupled Mixup for Data-efficient Learning

SpliceMix: A Cross-scale and Semantic Blending Augmentation Strategy for Multi-label Image Classification.

Pseudo-Bag Mixup Augmentation for Multiple Instance Learning-Based Whole Slide Image Classification

DiffuseMix: Label-Preserving Data Augmentation with Diffusion Models