Abstract:Text classification is one of the classic tasks in the field of natural language processing. The goal is to identify the category to which the text belongs. Text categorization is widely used in email detection, sentiment analysis, topic marking and other fields. However, good text representation is the key to improve the performance of natural language processing tasks such as text classification. Traditional text representation adopts bag-of-words model or vector space model, which not only loses the context information of the text, but also faces the problems of high latitude and high sparsity. In recent years, with the increase of data and the improvement of computing performance, the use of deep learning technology to represent and classify texts has attracted great attention. Convolutional neural network, recurrent neural network and recurrent neural network with attention mechanism are used to represent the text, and then to classify the text and other natural language processing tasks, all of which have better performance than the traditional methods. In this paper, we design two sentence-level text representation and classification models based on the deep network. The details are as follows: (1) Text representation and classification model based on bidirectional cyclic and convolutional neural networks-BRCNN. Brcnn's input is the word vector corresponding to each word in the sentence; After using cyclic neural network to extract word order information in sentences, convolution neural network is used to extract higher-level features of sentences. After convolution, the maximum pool operation is used to obtain sentence vectors. At last, softmax classifier is used for classification. Cyclic neural network can capture the word order information in sentences, while convolutional neural network can extract useful features. Experiments on eight text classification tasks show that BRCNN model can get better text feature representation, and the classification accuracy rate is equal to or higher than that of the prior art.. (2) A text representation and classification model based on attention mechanism and convolutional neural network-ACNN. ACNN model uses the recurrent neural network with attention mechanism to obtain the context vector; Then convolution neural network is used to extract more advanced feature information. The maximum pool operation is adopted to obtain a sentence vector; At last, the softmax classifier is used to classify the text. Experiments on eight text classification benchmark data sets show that ACNN improves the stability of model convergence, and can converge to an optimal or local optimal solution better than BRCNN.

A Text Representation Model Based on Convolutional Neural Network and Variational Auto Encoder.

An Intelligent CNN-VAE Text Representation Technology Based on Text Semantics for Comprehensive Big Data

Incorporating Knowledge into Neural Network for Text Representation.

Database Systems for Advanced Applications

Combining Vector Space Features and Convolution Neural Network for Text Sentiment Analysis.

Application of Dual-Channel Convolutional Neural Network Algorithm in Semantic Feature Analysis of English Text Big Data

Research On Text Classification Based On Deep Neural Network

Text Sentiment Analysis Based on Convolutional Neural Network and Bidirectional LSTM Model.

Attention and Language Ensemble for Scene Text Recognition with Convolutional Sequence Modeling.

Adaptive Region Embedding for Text Classification.

Variational autoencoder-based neural network model compression

Tightly-coupled Convolutional Neural Network with Spatial-Temporal Memory for Text Classification.

A Study of Text Vectorization Method Combining Topic Model and Transfer Learning

Study on text representation method based on deep learning and topic information

DCNN-BiGRU Text Classification Model Based on BERT Embedding

A Multi-Modal High Spatial Resolution Aerial Imagery Scene Classification Model with Visual Enhancement

Hybrid Graph Neural Network Model Design and Modeling Reasoning for Text Feature Extraction and Recognition

Text Classification Based on Neural Network Fusion

A neural topic model with word vectors and entity vectors for short texts

Convolutional Network Embedding of Text-Enhanced Representation for Knowledge Graph Completion

A text classification network model combining machine learning and deep learning