Abstract:Current evidence indicates that the semantic representation of question and answer sentences is better generated by deep neural network-based sentence models than traditional methods in community answer selection tasks. In particular, as a widely recognized language model, the self-attention model computes the similarity between the specific word and the whole sets of words in the same sentence and generates new semantic representation through the similarity-weighted summation of semantic representations of the whole words. However, the self-attention operation entirely considers all the signals with a weighted sum operation, which disperses the distribution of attention, which may result in overlooking the relation of neighboring signals. This issue becomes serious when applying the self-attention model to online community question answering platforms because of the varied length of the user-generated questions and answers. To address this problem, we introduce an attention mechanism enhanced local self-attention (LSA), which restricts the range of original self-attention by a local window mechanism, thereby scaling linearly when increasing the sequence length. Furthermore, we propose stacking multiple LSA layers to model the relationship of multiscale $n$ -gram features. It captures the word-to-word relationship in the first layer and then captures the chunk-to-chunk (such as lexical $n$ -gram phrases) relationship in its deeper layers. We also test the effectiveness of the proposed model by applying the learned representation through the LSA model to a Siamese and a classification network in community question answer selection tasks. Experiments on the public datasets show that the proposed LSA achieves a good performance.

A Span-based Dynamic Local Attention Model for Sequential Sentence Classification.

Jointly Trained Sequential Labeling and Classification by Sparse Attention Neural Networks.

Cascaded Semantic and Positional Self-Attention Network for Document Classification

A Hierarchical Model with Recurrent Convolutional Neural Networks for Sequential Sentence Classification

A Context-focused Attention Evolution Model for Aspect-based Sentiment Classification

Sentiment classification using bidirectional LSTM-SNP model and attention mechanism

Span-based joint entity and relation extraction augmented with sequence tagging mechanism

Optimizing Automatic Text Classification Approach in Adaptive Online Collaborative Discussion–A Perspective of Attention Mechanism-Based Bi-LSTM

Boundary Enhanced Neural Span Classification for Nested Named Entity Recognition

Span-based Joint Entity and Relation Extraction with Attention-based Span-specific and Contextual Semantic Representations.

A Lexicon-Based Supervised Attention Model for Neural Sentiment Analysis.

A span-based model for aspect terms extraction and aspect sentiment classification

Boundary-Aware Dual Biaffine Model for Sequential Sentence Classification in Biomedical Documents.

Learning Latent Opinions for Aspect-level Sentiment Classification

Aspect-level Sentiment Classification Based on Attention-Bilstm Model and Transfer Learning

Dual-axial Self-Attention Network for Text Classification

Aspect-level Sentiment Classification with Multi-head-attention-based Multi-channel Graph Convolutional Networks.

Joint Learning of Local and Global Features for Aspect-based Sentiment Classification

Effective Strategies for Combining Attention Mechanism with LSTM for Aspect-Level Sentiment Classification

A Local Self-Attention Sentence Model for Answer Selection Task in CQA Systems

Boosting Span-based Joint Entity and Relation Extraction via Squence Tagging Mechanism