Abstract:In sentence modeling and classification, convolutional neural network approaches have recently achieved state-of-the-art results, but all such efforts process word vectors sequentially and neglect long-distance dependencies. To exploit both deep learning and linguistic structures, we propose a tree-based convolutional neural network model which exploit various long-distance relationships between words. Our model improves the sequential baselines on all three sentiment and question classification tasks, and achieves the highest published accuracy on TREC.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is that in sentence modeling and classification, although the existing Convolutional Neural Network (CNN) methods have achieved state - of - the - art results, they only consider sequential information when processing word vectors and ignore long - distance dependencies. The author proposes a dependency - based convolutional method (Dependency - based Convolutional Neural Networks, DCNNs), which utilizes n - grams in the tree structure rather than surface n - grams, thereby exploiting non - local interactions between words, aiming to combine deep learning with linguistic structures and improve the model's ability to capture long - distance dependencies. Specifically, the paper mainly focuses on the following aspects: 1. **Capturing long - distance dependencies**: Traditional CNNs usually only consider continuous word sequences when processing text data, ignoring the possible long - distance dependencies between words. Such dependencies are crucial for understanding the true meaning of sentences, especially in tasks such as sentiment analysis, subjectivity analysis, and question classification. 2. **Combining linguistic structures**: By introducing the dependency tree structure, the model can better capture the structural information within the sentence, especially those long - distance dependencies that have an important impact on the sentence sentiment or category. 3. **Improving classification performance**: The author hopes that the proposed DCNN model can outperform the existing sequential - based CNN models in multiple classification tasks, especially in sentiment analysis and question classification tasks. The paper verifies the effectiveness of the DCNN model through experiments, especially its performance on the TREC dataset is better than all previously published results, including those methods using a large number of hand - designed features.

Dependency-based Convolutional Neural Networks for Sentence Embedding

Tree-based Convolution for Sentence Modeling.

Dependency-based Siamese Long Short-Term Memory Network for Learning Sentence Representations.

Dependency Sensitive Convolutional Neural Networks for Modeling Sentences and Documents

Discriminative Neural Sentence Modeling by Tree-Based Convolution

Tree-based convolution: A new architecture for sentence modeling

A syntactic dependency method for aspect-level sentiment classification by deep learning

A Hierarchical Model with Recurrent Convolutional Neural Networks for Sequential Sentence Classification

On Tree-Based Neural Sentence Modeling

Tree-Structured Neural Machine for Linguistics-Aware Sentence Generation

A Convolutional Neural Network for Modelling Sentences

Sentence-Embedding and Similarity via Hybrid Bidirectional-LSTM and CNN Utilizing Weighted-Pooling Attention

A Long Dependency Aware Deep Architecture for Joint Chinese Word Segmentation and POS Tagging.

Combining Convolution and Recursive Neural Networks for Sentiment Analysis

An Efficient Cross-lingual Model for Sentence Classification Using Convolutional Neural Network.

Long Short-Term Memory with Dynamic Skip Connections.

Enhancing Semantic Representations of Bilingual Word Embeddings with Syntactic Dependencies

Graph-based Dependency Parsing with Bidirectional LSTM.

Modeling Structured Dependency Tree with Graph Convolutional Networks for Aspect-Level Sentiment Classification

BeLLM: Backward Dependency Enhanced Large Language Model for Sentence Embeddings