Abstract:Neural network-based approaches have become the driven forces for Natural Language Processing (NLP) tasks. Conventionally, there are two mainstream neural architectures for NLP tasks: the recurrent neural network (RNN) and the convolution neural network (ConvNet). RNNs are good at modeling long-term dependencies over input texts, but preclude parallel computation. ConvNets do not have memory capability and it has to model sequential data as un-ordered features. Therefore, ConvNets fail to learn sequential dependencies over the input texts, but it is able to carry out high-efficient parallel computation. As each neural architecture, such as RNN and ConvNets, has its own pro and con, integration of different architectures is assumed to be able to enrich the semantic representation of texts, thus enhance the performance of NLP tasks. However, few investigation explores the reconciliation of these seemingly incompatible architectures. To address this issue, we propose a hybrid architecture based on a novel hierarchical multi-granularity attention mechanism, named Multi-granularity Attention-based Hybrid Neural Network (MahNN). The attention mechanism is to assign different weights to different parts of the input sequence to increase the computation efficiency and performance of neural models. In MahNN, two types of attentions are introduced: the syntactical attention and the semantical attention. The syntactical attention computes the importance of the syntactic elements (such as words or sentence) at the lower symbolic level and the semantical attention is used to compute the importance of the embedded space dimension corresponding to the upper latent semantics. We adopt the text classification as an exemplifying way to illustrate the ability of MahNN to understand texts. The experimental results on a variety of datasets demonstrate that MahNN outperforms most of the state-of-the-arts for text classification.

Global-Local Mutual Attention Model for Text Classification

Hierarchical Multi-Granularity Attention- Based Hybrid Neural Network for Text Classification.

Multi-Label Text Classification Model Integrating Label Attention and Historical Attention

Global-local graph attention: unifying global and local attention for node classification

Integration of Global and Local Information for Text Classification

Label-Attentive Hierarchical Attention Network for Text Classification

DCCL: Dual-channel hybrid neural network combined with self-attention for text classification

MLGN:A Multi-Label Guided Network for Improving Text Classification

Optimizing Automatic Text Classification Approach in Adaptive Online Collaborative Discussion–A Perspective of Attention Mechanism-Based Bi-LSTM

Learn More from Context: Joint Modeling of Local and Global Attention for Aspect Sentiment Classification

GLA: Global–Local Attention for Image Description

Multi-label text classification based on semantic-sensitive graph convolutional network

Addressing challenges in multi-label text classification: A joint attention and shared semantic space approach

Multichannel CNN with Attention for Text Classification

Coarse to Fine: Multi-label Image Classification with Global/Local Attention

A Novel Method Using Local Feature to Enhance GCN for Text Classification

Text Classification with Attention Gated Graph Neural Network

A Multi-Classification Sentiment Analysis Model of Chinese Short Text Based on Gated Linear Units and Attention Mechanism

Research of multi-label text classification based on label attention and correlation networks

Novel GCN Model Using Dense Connection and Attention Mechanism for Text Classification

Global Meets Local: Effective Multi-Label Image Classification Via Category-AwareWeak Supervision