Abstract:Self-supervised learning approach like contrastive learning is attached great attention in natural language processing. It uses pairs of training data augmentations to build a classification task for an encoder with well representation ability. However, the construction of learning pairs over contrastive learning is much harder in NLP tasks. Previous works generate word-level changes to form pairs, but small transforms may cause notable changes on the meaning of sentences as the discrete and sparse nature of natural language. In this paper, adversarial training is performed to generate challenging and harder learning adversarial examples over the embedding space of NLP as learning pairs. Using contrastive learning improves the generalization ability of adversarial training because contrastive loss can uniform the sample distribution. And at the same time, adversarial training also enhances the robustness of contrastive learning. Two novel frameworks, supervised contrastive adversarial learning (SCAL) and unsupervised SCAL (USCAL), are proposed, which yields learning pairs by utilizing the adversarial training for contrastive learning. The label-based loss of supervised tasks is exploited to generate adversarial examples while unsupervised tasks bring contrastive loss. To validate the effectiveness of the proposed framework, we employ it to Transformer-based models for natural language understanding, sentence semantic textual similarity and adversarial learning tasks. Experimental results on GLUE benchmark tasks show that our fine-tuned supervised method outperforms BERT$_{base}$ over 1.75\%. We also evaluate our unsupervised method on semantic textual similarity (STS) tasks, and our method gets 77.29\% with BERT$_{base}$. The robustness of our approach conducts state-of-the-art results under multiple adversarial datasets on NLI tasks.

SCAT: Robust Self-supervised Contrastive Learning via Adversarial Training for Text Classification

Adversarial Supervised Contrastive Learning

Self-Supervised Contrastive Learning with Adversarial Perturbations for Defending Word Substitution-based Attacks

Simple Contrastive Representation Adversarial Learning for NLP Tasks

Introducing Adaptive Continuous Adversarial Training (ACAT) to Enhance ML Robustness

ValCAT: Variable-Length Contextualized Adversarial Transformations Using Encoder-Decoder Language Model

CAT:Collaborative Adversarial Training

Contrastive Adversarial Training for Unsupervised Domain Adaptation

Improving Gradient-based Adversarial Training for Text Classification by Contrastive Learning and Auto-Encoder.

CAT-Gen: Improving Robustness in NLP Models via Controlled Adversarial Text Generation

Robust Pre-Training by Adversarial Contrastive Learning

CAT: Customized Adversarial Training for Improved Robustness

Contrastive learning with text augmentation for text classification

Rethinking the Effect of Data Augmentation in Adversarial Contrastive Learning

Towards Improving Adversarial Training of NLP Models

CATIL: Customized Adversarial Training based on Instance Loss

LexicalAT: Lexical-Based Adversarial Reinforcement Training for Robust Sentiment Classification

TextCheater: A Query-Efficient Textual Adversarial Attack in the Hard-Label Setting

Sample Efficient Detection and Classification of Adversarial Attacks via Self-Supervised Embeddings

Learning to Attack: Towards Textual Adversarial Attacking in Real-world Situations

Robustness through Cognitive Dissociation Mitigation in Contrastive Adversarial Training