SynerMix: Synergistic Mixup Solution for Enhanced Intra-Class Cohesion and Inter-Class Separability in Image Classification

Ye Xu,Ya Gao,Xiaorong Qiu,Yang Chen,Ying Ji
2024-03-24
Abstract:To address the issues of MixUp and its variants (e.g., Manifold MixUp) in image classification tasks-namely, their neglect of mixing within the same class (intra-class mixup) and their inadequacy in enhancing intra-class cohesion through their mixing operations-we propose a novel mixup method named SynerMix-Intra and, building upon this, introduce a synergistic mixup solution named SynerMix. SynerMix-Intra specifically targets intra-class mixup to bolster intra-class cohesion, a feature not addressed by current mixup methods. For each mini-batch, it leverages feature representations of unaugmented original images from each class to generate a synthesized feature representation through random linear interpolation. All synthesized representations are then fed into the classification and loss layers to calculate an average classification loss that significantly enhances intra-class cohesion. Furthermore, SynerMix combines SynerMix-Intra with an existing mixup approach (e.g., MixUp, Manifold MixUp), which primarily focuses on inter-class mixup and has the benefit of enhancing inter-class separability. In doing so, it integrates both inter- and intra-class mixup in a balanced way while concurrently improving intra-class cohesion and inter-class separability. Experimental results on six datasets show that SynerMix achieves a 0.1% to 3.43% higher accuracy than the best of either MixUp or SynerMix-Intra alone, averaging a 1.16% gain. It also surpasses the top-performer of either Manifold MixUp or SynerMix-Intra by 0.12% to 5.16%, with an average gain of 1.11%. Given that SynerMix is model-agnostic, it holds significant potential for application in other domains where mixup methods have shown promise, such as speech and text classification. Our code is publicly available at:
Computer Vision and Pattern Recognition,Machine Learning
What problem does this paper attempt to address?
The paper attempts to address some key limitations of existing data augmentation techniques (such as MixUp and its variants) in image classification tasks: 1. **Ignoring Intra-Class Mixup**: Existing MixUp methods (e.g., MixUp, Manifold MixUp, CutMix, etc.) often ignore intra-class mixup, which involves mixing samples within the same class. This oversight leads to underutilization of the relationships between samples within the same class, thereby limiting the improvement in image classification performance. 2. **Insufficient Intra-Class Cohesion**: While existing MixUp methods excel in enhancing inter-class separability, they are less effective in enhancing intra-class cohesion through mixing operations. This results in insufficient compactness of intra-class feature representations, affecting the overall performance of the classification task. To address these issues, the paper proposes two new methods: - **SynerMix-Intra**: Specifically targets intra-class mixup by generating synthetic feature representations of samples within the same class to enhance intra-class cohesion. Specifically, for each class in each mini-batch, SynerMix-Intra generates a synthetic feature representation through random linear interpolation and inputs it into the classification layer to compute the average classification loss, thereby promoting tight clustering of intra-class feature representations. - **SynerMix**: Combines SynerMix-Intra with existing inter-class mixup methods (such as MixUp or Manifold MixUp), balancing both intra-class and inter-class mixup while simultaneously improving intra-class cohesion and inter-class separability. In this way, SynerMix can better adapt to the commonalities within classes and differences between classes in various image classification tasks, thereby enhancing the model's generalization ability and robustness. Experimental results show that SynerMix outperforms existing MixUp methods on multiple public image datasets, validating its effectiveness in simultaneously improving intra-class cohesion and inter-class separability. Additionally, since SynerMix is model-agnostic, it has broad application potential in other domains (such as speech and text classification).