Abstract:In this paper, we propose a sequential neural encoder with latent structured description (SNELSD) for modeling sentences. This model introduces latent chunk-level representations into conventional sequential neural encoders, i.e., recurrent neural networks with long short-term memory (LSTM) units, to consider the compositionality of languages in semantic modeling. An SNELSD model has a hierarchical structure that includes a detection layer and a description layer. The detection layer predicts the boundaries of latent word chunks in an input sentence and derives a chunk-level vector for each word. The description layer utilizes modified LSTM units to process these chunk-level vectors in a recurrent manner and produces sequential encoding outputs. These output vectors are further concatenated with word vectors or the outputs of a chain LSTM encoder to obtain the final sentence representation. All the model parameters are learned in an end-to-end manner without a dependency on additional text chunking or syntax parsing. A natural language inference task and a sentiment analysis task are adopted to evaluate the performance of our proposed model. The experimental results demonstrate the effectiveness of the proposed SNELSD model on exploring task-dependent chunking patterns during the semantic modeling of sentences. Furthermore, the proposed method achieves better performance than conventional chain LSTMs and tree-structured LSTMs on both tasks.

Phrase-level Self-Attention Networks for Universal Sentence Encoding

Deep Attentive Sentence Ordering Network.

SAC: Accelerating and Structuring Self-Attention Via Sparse Adaptive Connection.

A Window-Based Self-Attention Approach for Sentence Encoding

Multiple Structural Priors Guided Self Attention Network for Language Understanding

Deps-SAN: Neural Machine Translation with Dependency-Scaled Self-Attention Network

Boosting Neural Machine Translation with Dependency-Scaled Self-Attention Network.

Reinforced Self-Attention Network: a Hybrid of Hard and Soft Attention for Sequence Modeling

Multi-Granularity Self-Attention for Neural Machine Translation

Phrase-Level Global-Local Hybrid Model for Sentence Embedding

Self-Attention with Cross-Lingual Position Representation

S2SAN: A sentence-to-sentence attention network for sentiment analysis of online reviews

SG-Net: Syntax Guided Transformer for Language Representation

DiSAN: Directional Self-Attention Network for RNN/CNN-Free Language Understanding

Structured Self-Attention Weights Encode Semantics in Sentiment Analysis

Encoding Syntactic Knowledge in Neural Networks for Sentiment Classification.

Learning Universal Sentence Representations with Mean-Max Attention Autoencoder

SAAN: A Sentiment-Aware Attention Network for Sentiment Analysis.

How Does Selective Mechanism Improve Self-Attention Networks?

A Sequential Neural Encoder with Latent Structured Description for Modeling Sentences.

Multimodal Semantic Attention Network for Video Captioning