ADAM: An Attentional Data Augmentation Method for Extreme Multi-label Text Classification

Jiaxin Zhang,Jie Liu,Shaowei Chen,Shaoxin Lin,Bingquan Wang,Shanpeng Wang
DOI: https://doi.org/10.1007/978-3-031-05933-9_11
2022-01-01
Abstract:Extreme Multi-label text Classification (XMC) is a fundamental text mining task, which aims to assign multiple labels related to the given text from a large-scale label set. Various models and many data augmentation methods are proposed to improve classification performance. However, the classification performance is limited due to the long tail distribution of labels, which is an essential characteristic of XMC. To address this problem, we propose a novel data augmentation method named Attentional Data Augmentation Method (ADAM) for long tail labels. Specifically, we split each sentence into several segments of equal length and use an attention-based neural network to explore the core segments of long tail labels. The unimportant segments of each instance from the dataset are considered to be replaced by those segments related to the long tail labels. Extensive experiments show that ADAM has an improvement based on the XMC method, especially on the prediction of long tail labels.
What problem does this paper attempt to address?