Abstract:Neural network-based approaches have become the driven forces for Natural Language Processing (NLP) tasks. Conventionally, there are two mainstream neural architectures for NLP tasks: the recurrent neural network (RNN) and the convolution neural network (ConvNet). RNNs are good at modeling long-term dependencies over input texts, but preclude parallel computation. ConvNets do not have memory capability and it has to model sequential data as un-ordered features. Therefore, ConvNets fail to learn sequential dependencies over the input texts, but it is able to carry out high-efficient parallel computation. As each neural architecture, such as RNN and ConvNets, has its own pro and con, integration of different architectures is assumed to be able to enrich the semantic representation of texts, thus enhance the performance of NLP tasks. However, few investigation explores the reconciliation of these seemingly incompatible architectures. To address this issue, we propose a hybrid architecture based on a novel hierarchical multi-granularity attention mechanism, named Multi-granularity Attention-based Hybrid Neural Network (MahNN). The attention mechanism is to assign different weights to different parts of the input sequence to increase the computation efficiency and performance of neural models. In MahNN, two types of attentions are introduced: the syntactical attention and the semantical attention. The syntactical attention computes the importance of the syntactic elements (such as words or sentence) at the lower symbolic level and the semantical attention is used to compute the importance of the embedded space dimension corresponding to the upper latent semantics. We adopt the text classification as an exemplifying way to illustrate the ability of MahNN to understand texts. The experimental results on a variety of datasets demonstrate that MahNN outperforms most of the state-of-the-arts for text classification.

Hierarchical and Bidirectional Joint Multi-Task Classifiers for Natural Language Understanding

Hierarchical Inter-Attention Network for Document Classification with Multi-Task Learning.

Coarse-to-Fine: Hierarchical Multi-task Learning for Natural Language Understanding.

Hierarchical Multi-Granularity Attention- Based Hybrid Neural Network for Text Classification.

Hierarchical Multilabel Text Classification Via Multitask Learning.

HAIN: Multi-label Classification with Hierarchical Attention-based Interaction Network for Multi-turn Dialogue Texts

Tri-level Joint Natural Language Understanding for Multi-turn Conversational Datasets

UnifiedMLLM: Enabling Unified Representation for Multi-modal Multi-tasks With Large Language Model

Pre-Trained Joint Model for Intent Classification and Slot Filling with Semantic Feature Fusion

12-in-1: Multi-Task Vision and Language Representation Learning

Multi-task Learning with Bidirectional Language Models for Text Classification

Multi-Task Deep Learning for User Intention Understanding in Speech Interaction Systems

Deep collaborative multi-task network: A human decision process inspired model for hierarchical image classification

Multi-Task Learning for Front-End Text Processing in TTS

HirMTL: Hierarchical Multi-Task Learning for dense scene understanding

A cross modal hierarchical fusion multimodal sentiment analysis method based on multi-task learning

An Interactive Fusion Model for Hierarchical Multi-label Text Classification

Large Language Model as a Universal Clinical Multi-task Decoder

Hierarchical Multitask Learning for CTC-based Speech Recognition

Hierarchical multiples self-attention mechanism for multi-modal analysis

A Bi-directional Multi-hop Inference Model for Joint Dialog Sentiment Classification and Act Recognition