Chinese Short Text Multi-Classification Based on Word and Part-of-Speech Tagging Embedding

Juan Tian,Dingju Zhu,Hui Long
DOI: https://doi.org/10.1145/3302425.3302430
2018-12-21
Abstract:In this paper, a convolutional neural network model with part-of-speech tagging and word double embedding is proposed to deal with text multi-classification problem. The chunk max pooling is added in the sampling layer for down sampling to enhance the ability of feature extraction. And in the text preprocessing, the word segmentation knowledge base is expanded according to the content of the data set to improve the accuracy of the text preprocessing model. In order to verify the accuracy of the model, 8000 enterprise text profiles were used to classify the business categories. The experimental results show that compared with the traditional machine learning model and the standard convolutional neural network model, the accuracy of the proposed model is improved.
What problem does this paper attempt to address?