Label-Embedding Bi-directional Attentive Model for Multi-label Text Classification

Liu Naiyin,Wang Qianlong,Ren Jiangtao
DOI: https://doi.org/10.1007/s11063-020-10411-8
IF: 2.565
2021-01-01
Neural Processing Letters
Abstract:Multi-label text classification is a critical task in natural language processing field. As the latest language representation model, BERT obtains new state-of-the-art results in the classification task. Nevertheless, the text classification framework of BERT neglects to make full use of the token-level text representation and label embedding, since it only utilizes the final hidden state corresponding to CLS token as sequence-level text representation for classification. We assume that the finer-grained token-level text representation and label embedding contribute to classification. Consequently, in this paper, we propose a Label-Embedding Bi-directional Attentive model to improve the performance of BERT’s text classification framework. In particular, we extend BERT’s text classification framework with label embedding and bi-directional attention. Experimental results on the five datasets indicate that our model has notable improvements over both baselines and state-of-the-art models.
What problem does this paper attempt to address?