Abstract:Current evidence indicates that the semantic representation of question and answer sentences is better generated by deep neural network-based sentence models than traditional methods in community answer selection tasks. In particular, as a widely recognized language model, the self-attention model computes the similarity between the specific word and the whole sets of words in the same sentence and generates new semantic representation through the similarity-weighted summation of semantic representations of the whole words. However, the self-attention operation entirely considers all the signals with a weighted sum operation, which disperses the distribution of attention, which may result in overlooking the relation of neighboring signals. This issue becomes serious when applying the self-attention model to online community question answering platforms because of the varied length of the user-generated questions and answers. To address this problem, we introduce an attention mechanism enhanced local self-attention (LSA), which restricts the range of original self-attention by a local window mechanism, thereby scaling linearly when increasing the sequence length. Furthermore, we propose stacking multiple LSA layers to model the relationship of multiscale $n$ -gram features. It captures the word-to-word relationship in the first layer and then captures the chunk-to-chunk (such as lexical $n$ -gram phrases) relationship in its deeper layers. We also test the effectiveness of the proposed model by applying the learned representation through the LSA model to a Siamese and a classification network in community question answer selection tasks. Experiments on the public datasets show that the proposed LSA achieves a good performance.

Attention Boosted Sequential Inference Model

Residual Connected Enhanced Sequential Inference Model for Natural Language Inference

Syntax-based Attention Model for Natural Language Inference.

Research on Attention Memory Networks As a Model for Learning Natural Language Inference.

Building Sequential Inference Models for End-to-End Response Selection.

Learning Natural Language Inference using Bidirectional LSTM model and Inner-Attention.

Attention-Fused Deep Matching Network for Natural Language Inference

Natural Language Inference Using Lstm Model With Sentence Fusion

Collaborative Attention Network for Natural Language Inference

Enhancing and Combining Sequential and Tree LSTM for Natural Language Inference.

Syntax-Aware Attention for Natural Language Inference with Phrase-Level Matching

SEASum: Syntax-Enriched Abstractive Summarization

Modeling Selective Feature Attention for Representation-based Siamese Text Matching

Improving Self-Attention Networks with Sequential Relations

Enhanced soft attention mechanism with an inception-like module for image captioning

Double Attention Mechanism for Sentence Embedding.

A Local Self-Attention Sentence Model for Answer Selection Task in CQA Systems

Seq2seq Attentional Siamese Neural Networks For Text-Dependent Speaker Verification

Reinforced Self-Attention Network: a Hybrid of Hard and Soft Attention for Sequence Modeling

A Cognition Based Attention Model for Sentiment Analysis.

Enhanced Lstm For Natural Language Inference