Abstract:Source-Free domain adaptive Object Detection (SFOD) aims to transfer a detector (pre-trained on source domain) to new unlabelled target domains. Current SFOD methods typically follow the Mean Teacher framework, where weak-to-strong augmentation provides diverse and sharp contrast for self-supervised learning. However, this augmentation strategy suffers from an inherent problem called crucial semantics loss: Due to random, strong disturbance, strong augmentation is prone to losing typical visual components, hindering cross-domain feature extraction. To address this thus-far ignored limitation, this paper introduces a novel Weak-to-Strong Contrastive Learning (WSCoL) approach. The core idea is to distill semantics lossless knowledge in the weak features (from the weak/teacher branch) to guide the representation learning upon the strong features (from the strong/student branch). To achieve this, we project the original features into a shared space using a mapping network, thereby reducing the bias between the weak and strong features. Meanwhile, a weak features-guided contrastive learning is performed in a weak-to-strong manner alternatively. Specifically, we first conduct an adaptation-aware prototype-guided clustering on the weak features to generate pseudo labels for corresponding strong features matched through proposals. Sequentially, we identify positive-negative samples based on the pseudo labels and perform cross-category contrastive learning on the strong features where an uncertainty estimator encourages adaptive background contrast. Extensive experiments demonstrate that WSCoL yields new state-of-the-art performance, offering a built-in mechanism mitigating crucial semantics loss for traditional Mean Teacher framework. The code and data will be released soon.

Enhancing Offensive Language Detection with Data Augmentation and Knowledge Distillation

COLD: A Benchmark for Chinese Offensive Language Detection

Enhanced Offensive Language Detection Through Data Augmentation

Chinese offensive language analysis based on Bidirectional Encoder Representation Transformer (BERT)

Cross-Cultural Transfer Learning for Chinese Offensive Language Detection

Multi-Level Knowledge Distillation for Out-of-Distribution Detection in Text

ToxiCloakCN: Evaluating Robustness of Offensive Language Detection in Chinese with Cloaking Perturbations

Investigating cross-lingual training for offensive language detection

Developing Linguistic Patterns to Mitigate Inherent Human Bias in Offensive Language Detection

Fortifying Toxic Speech Detectors Against Veiled Toxicity

HarmAug: Effective Data Augmentation for Knowledge Distillation of Safety Guard Models

Cross-lingual offensive speech identification with transfer learning for low-resource languages

Enhancing Out-Of-Domain Utterance Detection with Data Augmentation Based on Word Embeddings

Learning to Augment Distributions for Out-of-Distribution Detection

Unsupervised Domain Adaptation for Hate Speech Detection Using a Data Augmentation Approach

Revealing the Two Sides of Data Augmentation: An Asymmetric Distillation-based Win-Win Solution for Open-Set Recognition

Cultural Compass: Predicting Transfer Learning Success in Offensive Language Detection with Cultural Features

Adversarial Defense Teacher for Cross-Domain Object Detection under Poor Visibility Conditions

Detect All Abuse! Toward Universal Abusive Language Detection Models

Rethinking Weak-to-Strong Augmentation in Source-Free Domain Adaptive Object Detection