Abstract:Neural network models have been widely used in the field of natural language processing (NLP). Recurrent neural networks (RNNs), which have the ability to process sequences of arbitrary length, are common methods for sequence modeling tasks. Long short-term memory (LSTM) is one kind of RNNs and has achieved remarkable performance in text classification. However, due to the high dimensionality and sparsity of text data, and to the complex semantics of the natural language, text classification presents difficult challenges. In order to solve the above problems, a novel and unified architecture which contains a bidirectional LSTM (BiLSTM), attention mechanism and the convolutional layer is proposed in this paper. The proposed architecture is called attention-based bidirectional long short-term memory with convolution layer (AC-BiLSTM). In AC-BiLSTM, the convolutional layer extracts the higher-level phrase representations from the word embedding vectors and BiLSTM is used to access both the preceding and succeeding context representations. Attention mechanism is employed to give different focus to the information outputted from the hidden layers of BiLSTM. Finally, the softmax classifier is used to classify the processed context information. AC-BiLSTM is able to capture both the local feature of phrases as well as global sentence semantics. Experimental verifications are conducted on six sentiment classification datasets and a question classification dataset, including detailed analysis for AC-BiLSTM. The results clearly show that AC-BiLSTM outperforms other state-of-the-art text classification methods in terms of the classification accuracy.

Convolutional Long Short-term Memory for Long Length Document Classification

Database Systems for Advanced Applications

Long Document Classification From Local Word Glimpses via Recurrent Attention Learning.

A multiclass classification framework for document categorization

A text classification method based on a convolutional and bidirectional long short-term memory model

Hierarchical Multi-granularity Interaction Graph Convolutional Network for Long Document Classification

Local Bidirectional Long Short Term Memory for Text Classification

A Multi-feature Fusion Method with Attention Mechanism for Long Text Classification

Cached Long Short-Term Memory Neural Networks for Document-Level Sentiment Classification.

Multi-Timescale Long Short-Term Memory Neural Network for Modelling Sentences and Documents

A Long-Text Classification Method of Chinese News Based on BERT and CNN

Automatically Classifying Chinese Judgment Documents Using Character-Level Convolutional Neural Networks

A Hybrid Model Based on Convolutional Neural Network and Long Short-Term Memory for Multi-label Text Classification

Research of Text Classification Based on TF-IDF and CNN-LSTM

Attention-based BiLSTM fused CNN with gating mechanism model for Chinese long text classification

Deep Learning for Technical Document Classification

Chinese Text Classification Model Based on Deep Learning

A text classification network model combining machine learning and deep learning

Bidirectional LSTM with attention mechanism and convolutional layer for text classification