Abstract:Text2text question classification (TQC), as a particular application case of question classification (QC), is of great practical value. Traditional QC methods usually label categories of questions using one or mul-tiple keywords provided by users. In contrast, in TQC, each question in natural language is automatically categorized into pre-designed standard question classes, which are coded in the form of short text. Because of this unique characteristic, TQC relies on a specifically designed framework and should be trained and validated based on customized experimental datasets. Previous TQC-related work mainly uti-lized textual similarity-matching methods. However, no effective pairwise learning paradigm has been proposed in TQC to model correlations between input text and classes; and the influence of distance met-rics and loss function in TQC has not been investigated. In this work, we propose a novel and comprehen-sive strategy, Augmented Dynamic Multi-layer Contrastive (ADMC), to resolve the challenge of TQC. Our framework consists of (1) an optional data augmentation module, (2) one stage for dynamic negative sampling, and (3) one stage for precise matching. The comprehensive TQC framework with ADMC strat-egy in this work resolves data imbalance and explores distance metrics learning via multiple augmenta-tion options and dynamic negative sampling based on multi-layer contrastive learning. To compensate for the shortage of public datasets for this task, we collected two real-world datasets and adaptively expanded three existing public datasets, which will be available after data masking. The results show that our ADMC outperformed other baseline methods investigated in this paper. The codes are available at https://github.com/WJULYW/ADMC. (c) 2023 Elsevier B.V. All rights reserved.

Automatic Counterfactual Augmentation for Robust Text Classification Based on Word-Group Search

Counterfactual Contrastive Learning for Robust Text Classification

Robustness to Spurious Correlations in Text Classification via Automatically Generated Counterfactuals

Improving Classifier Robustness through Active Generation of Pairwise Counterfactuals

What Have Been Learned & What Should Be Learned? An Empirical Study of How to Selectively Augment Text for Classification

Counterfactual Fairness in Text Classification through Robustness

Multi-round Counterfactual Generation: Interpreting and Improving Models of Text Classification.

Text Counterfactuals via Latent Optimization and Shapley-Guided Search

Selective Text Augmentation with Word Roles for Low-Resource Text Classification

Data Augmentations for Improved (Large) Language Model Generalization

Relation-based Counterfactual Data Augmentation and Contrastive Learning for Robustifying Natural Language Inference Models

Towards Robust Text Classification: Mitigating Spurious Correlations with Causal Learning

Exploring the Efficacy of Automatically Generated Counterfactuals for Sentiment Analysis.

TextCheater: A Query-Efficient Textual Adversarial Attack in the Hard-Label Setting

WordBlitz: an Efficient Hard-Label Textual Adversarial Attack Method Jointly Leveraging Adversarial Transferability and Word Importance

A Comparative Analysis of Counterfactual Explanation Methods for Text Classifiers

Synthesizing Counterfactual Samples for Effective Image-Text Matching

Implicit Counterfactual Data Augmentation for Robust Learning

STA: Self-controlled Text Augmentation for Improving Text Classifications

Preciser Comparison: Augmented Multi-Layer Dynamic Contrastive Strategy for Text2text Question Classification.

A Keyword-Enhanced Approach to Handle Class Imbalance in Clinical Text Classification