Multi-label Text Classification Model Based on Multi-level Constraint Augmentation and Label Association Attention
Xiao Wei,Jianbao Huang,Rui Zhao,Hang Yu,Zheng Xu
DOI: https://doi.org/10.1145/3586008
IF: 1.471
2023-05-01
ACM Transactions on Asian and Low-Resource Language Information Processing
Abstract:In the multi-label text classification task, a text usually corresponds to multiple label categories, and the labels have correlation and hierarchical structure. However, when the label hierarchy is unknown, the number of various labels is not balanced, which makes it difficult for the model to classify low-frequency labels. At the same time, due to the existence of similar labels, the model will be difficult to distinguish similar labels. In this paper, we propose a multi-label text classification model based on multi-level constraint augmentation and label association attention. Compared with traditional methods, our method has two contributions: (1) In order to alleviate the problem of unbalanced number of different label categories and ensure the rationality of sample generation, we propose a data augmentation method based on multi-level constraints. In the process of sample generation, this method uses historical generation information, sample original text information and sample topic to constrain the generated text. (2) In order to make the model recognize the associated labels accurately, we propose an interaction mechanism based on label association attention and filter gate. This method combines text information and label weight information. At the same time, our classification model considers the important weights of text sentences and effectively utilizes the co-occurrence relationship between labels. Experimental results on three benchmark datasets show that our model outperforms state-of-the-art methods on all main evaluation metrics, especially on low-frequency label prediction with sparse samples.
computer science, artificial intelligence