Abstract:Text classification is one of the fundamental tasks in natural language processing, which requires an agent to determine the most appropriate category for input sentences. Recently, deep neural networks have achieved impressive performance in this area, especially pretrained language models (PLMs). Usually, these methods concentrate on input sentences and corresponding semantic embedding generation. However, for another essential component: labels, most existing works either treat them as meaningless one-hot vectors or use vanilla embedding methods to learn label representations along with model training, underestimating the semantic information and guidance that these labels reveal. To alleviate this problem and better exploit label information, in this article, we employ self-supervised learning (SSL) in model learning process and design a novel self-supervised relation of relation (R <sup>2</sup> ) classification task for label utilization from a one-hot manner perspective. Then, we propose a novel () for text classification, in which text classification and R <sup>2</sup> classification are treated as optimization targets. Meanwhile, triplet loss is employed to enhance the analysis of differences and connections among labels. Moreover, considering that one-hot usage is still short of exploiting label information, we incorporate external knowledge from WordNet to obtain multiaspect descriptions for label semantic learning and extend to a novel () from a label embedding perspective. One step further, since these fine-grained descriptions may introduce unexpected noise, we develop a mutual interaction module to select appropriate parts from input sentences and labels simultaneously based on contrastive learning (CL) for noise mitigation. Extensive experiments on different text classification tasks reveal that can effectively improve the classification performance and can make better use of label information and further improve the performance. As a byproduct, we have released the codes to facilitate other research.

GUDN: A novel guide network with label reinforcement strategy for extreme multi-label text classification

MLGN:A Multi-Label Guided Network for Improving Text Classification

GNN-SL: Sequence Labeling Based on Nearest Examples Via GNN

Deep Learning for Extreme Multi-label Text Classification

GNN-XML: Graph Neural Networks for Extreme Multi-label Text Classification

On Data Augmentation for Extreme Multi-label Classification

HAXMLNet: Hierarchical Attention Network for Extreme Multi-Label Text Classification

AttentionXML: Label Tree-based Attention-Aware Deep Model for High-Performance Extreme Multi-Label Text Classification

Gated Neural Network with Regularized Loss for Multi-label Text Classification.

XRR: Extreme Multi-label Text Classification with Candidate Retrieving and Deep Ranking

Multilabel Text Classification Using Multilayer DGAT

Multi-label text classification based on semantic-sensitive graph convolutional network

Don't Just Pay Attention, PLANT It: Transfer L2R Models to Fine-tune Attention in Extreme Multi-Label Text Classification

Text-Guided Mixup Towards Long-Tailed Image Categorization

Label-aware Document Representation via Hybrid Attention for Extreme Multi-Label Text Classification

Research of multi-label text classification based on label attention and correlation networks

A Dual-Branch Learning Model with Gradient-Balanced Loss for Long-Tailed Multi-Label Text Classification

Multi-label Text Classification Model Based on Multi-level Constraint Augmentation and Label Association Attention

Multi-directional guidance network for fine-grained visual classification

Meta-LMTC - Meta-Learning for Large-Scale Multi-Label Text Classification.

Description-Enhanced Label Embedding Contrastive Learning for Text Classification