Improving Generalization of Convolutional Neural Network Through Contrastive Augmentation
Xiaosong Li,Yanxia Wu,Chuheng Tang,Yan Fu,Lidan Zhang
DOI: https://doi.org/10.1016/j.knosys.2023.110543
IF: 8.139
2023-01-01
Knowledge-Based Systems
Abstract:Data augmentation is widely used to improve the generalization ability of convolutional neural networks in the image domain. The conventional augmentation schemes, e.g., single augmentation (including RandAug, Mixup, and CutMix) or batch augmentation, only optimize the network from pairs of augmented images and corresponding labels, leading to a loss of discriminative representation learning. To this end, we propose contrastive augmentation (CA), to learn discriminative representation by capturing the contrastive semantics among augmented samples. Specifically, the proposed CA method explicitly regularizes the similarity of augmented sample pairs to enable the model learn contrastive semantics. Equipped with CA at the training stage, the model simultaneously learns classification representation and contrastive semantics, which makes representation discriminative and does not incur additional inference computational costs. On conventional and fine-grained classification tasks, the experimental results show that the proposed method can effectively improve the generalization capability of convolutional neural networks, including ResNet, EfficientNet, MobileNet, ShuffleNet, and VAN. The compatibility experiment shows that our contrastive augmentation can be combined with other augmentation techniques, e.g., Mixup and CutMix, to further improve the model’s performance.