Abstract:Self-supervised learning approach like contrastive learning is attached great attention in natural language processing. It uses pairs of training data augmentations to build a classification task for an encoder with well representation ability. However, the construction of learning pairs over contrastive learning is much harder in NLP tasks. Previous works generate word-level changes to form pairs, but small transforms may cause notable changes on the meaning of sentences as the discrete and sparse nature of natural language. In this paper, adversarial training is performed to generate challenging and harder learning adversarial examples over the embedding space of NLP as learning pairs. Using contrastive learning improves the generalization ability of adversarial training because contrastive loss can uniform the sample distribution. And at the same time, adversarial training also enhances the robustness of contrastive learning. Two novel frameworks, supervised contrastive adversarial learning (SCAL) and unsupervised SCAL (USCAL), are proposed, which yields learning pairs by utilizing the adversarial training for contrastive learning. The label-based loss of supervised tasks is exploited to generate adversarial examples while unsupervised tasks bring contrastive loss. To validate the effectiveness of the proposed framework, we employ it to Transformer-based models for natural language understanding, sentence semantic textual similarity and adversarial learning tasks. Experimental results on GLUE benchmark tasks show that our fine-tuned supervised method outperforms BERT$_{base}$ over 1.75\%. We also evaluate our unsupervised method on semantic textual similarity (STS) tasks, and our method gets 77.29\% with BERT$_{base}$. The robustness of our approach conducts state-of-the-art results under multiple adversarial datasets on NLI tasks.

ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer

OssCSE: Overcoming Surface Structure Bias in Contrastive Learning for Unsupervised Sentence Embedding

Simple Flow-Based Contrastive Learning for BERT Sentence Representations

Alleviating Over-smoothing for Unsupervised Sentence Representation.

Contrastive Learning Models for Sentence Representations

Simple Contrastive Representation Adversarial Learning for NLP Tasks

reCSE: Portable Reshaping Features for Sentence Embedding in Self-supervised Contrastive Learning

A Mutually Reinforced Framework for Pretrained Sentence Embeddings

CoT-BERT: Enhancing Unsupervised Sentence Representation through Chain-of-Thought

Boost Supervised Pretraining for Visual Transfer Learning: Implications of Self-Supervised Contrastive Representation Learning.

Cross-modal Contrastive Learning for Speech Translation

A Sentence is Worth 128 Pseudo Tokens: A Semantic-Aware Contrastive Learning Framework for Sentence Embeddings

PromptBERT: Improving BERT Sentence Embeddings with Prompts

Generate, Discriminate and Contrast: A Semi-Supervised Sentence Representation Learning Framework

CLSESSP: Contrastive learning of sentence embedding with strong semantic prototypes

C2BERT - Cross-contrast BERT for Chinese Biomedical Sentence Representation.

RankCSE: Unsupervised Sentence Representations Learning Via Learning to Rank

Contrastive Learning in Distilled Models

Unsupervised Sentence Embedding Model Based on Contrastive Learning

CCDC: A Chinese-Centric Cross Domain Contrastive Learning Framework

Advancing Semantic Textual Similarity Modeling: A Regression Framework with Translated ReLU and Smooth K2 Loss