Enhancing Deep Learning-Based Multi-label Text Classification with Capsule Network

Siyi Yan
DOI: https://doi.org/10.1088/1742-6596/1621/1/012037
2020-08-01
Journal of Physics: Conference Series
Abstract:Abstract Given a piece of text, multi-label text classification (MLTC) is designed to mark the most relevant one label or multiple labels for the text. Most of the existing MLTC models use convolutional neural network (CNN) as feature extractor, but CNN will lose information when dealing with MLTC task. In this paper, we explore the CNN combined with capsule network for MLTC. We use capsule network instead of pool layer in CNN to extract information related to classification results in high-dimensional features. We also explore the way of combining recurrent neural network (RNN) and CNN to model the characteristics of time and space for capsule network to complete classification. In two open MLTC datasets, our model achieves the better results as the baseline system, which shows the effectiveness of the combination of capsule network and CNN for MLTC.
What problem does this paper attempt to address?