Abstract:Textual representations play an important role in the field of natural language processing (NLP). The efficiency of NLP tasks, such as text comprehension and information extraction, can be significantly improved with proper textual representations. As neural networks are gradually applied to learn the representation of words and phrases, fairly efficient models of learning short text representations have been developed, such as the continuous bag of words (CBOW) and skip-gram models, and they have been extensively employed in a variety of NLP tasks. Because of the complex structure generated by the longer text lengths, such as sentences, algorithms appropriate for learning short textual representations are not applicable for learning long textual representations. One method of learning long textual representations is the Long Short-Term Memory (LSTM) network, which is suitable for processing sequences. However, the standard LSTM does not adequately address the primary sentence structure (subject, predicate and object), which is an important factor for producing appropriate sentence representations. To resolve this issue, this paper proposes the dependency-based LSTM model (D-LSTM). The D-LSTM divides a sentence representation into two parts: a basic component and a supporting component. The D-LSTM uses a pre-trained dependency parser to obtain the primary sentence information and generate supporting components, and it also uses a standard LSTM model to generate the basic sentence components. A weight factor that can adjust the ratio of the basic and supporting components in a sentence is introduced to generate the sentence representation. Compared with the representation learned by the standard LSTM, the sentence representation learned by the D-LSTM contains a greater amount of useful information. The experimental results show that the D-LSTM is superior to the standard LSTM for sentences involving compositional knowledge (SICK) data.

Definition Extraction with LSTM Recurrent Neural Networks.

Dependency-based Siamese Long Short-Term Memory Network for Learning Sentence Representations.

A Joint Model for Definition Extraction with Syntactic Connection and Semantic Consistency

Automated Discovery of Mathematical Definitions in Text with Deep Neural Networks

Definition Modeling: Learning to define word embeddings in natural language

LSTM-in-LSTM for Generating Long Descriptions of Images.

Cross-Sentence N-ary Relation Extraction with Graph LSTMs

Automatic Document Metadata Extraction Based on Deep Networks.

Weighted Automata Extraction and Explanation of Recurrent Neural Networks for Natural Language Tasks

An Open-Domain Event Extraction Method Incorporating Semantic and Dependent Syntactic Information

Named Entity Recognition with Bidirectional LSTM-CNNs

Entity Relationship Extraction Based on Bi-LSTM and Attention Mechanism

Top-down Tree Long Short-Term Memory Networks

xSense: Learning Sense-Separated Sparse Representations and Textual Definitions for Explainable Word Sense Networks

End-to-End Relation Extraction using LSTMs on Sequences and Tree Structures

A Survey Deep Learning Based Relation Extraction

Explicit Semantic Decomposition for Definition Generation

Deep Residual Learning for Weakly-Supervised Relation Extraction

Fine-grained Contrastive Learning for Definition Generation

Relation Extraction with Multi-instance Multi-label Convolutional Neural Networks.

Joint Extraction of Opinion Targets and Opinion Words Based on LSTM