Abstract:Textual representations play an important role in the field of natural language processing (NLP). The efficiency of NLP tasks, such as text comprehension and information extraction, can be significantly improved with proper textual representations. As neural networks are gradually applied to learn the representation of words and phrases, fairly efficient models of learning short text representations have been developed, such as the continuous bag of words (CBOW) and skip-gram models, and they have been extensively employed in a variety of NLP tasks. Because of the complex structure generated by the longer text lengths, such as sentences, algorithms appropriate for learning short textual representations are not applicable for learning long textual representations. One method of learning long textual representations is the Long Short-Term Memory (LSTM) network, which is suitable for processing sequences. However, the standard LSTM does not adequately address the primary sentence structure (subject, predicate and object), which is an important factor for producing appropriate sentence representations. To resolve this issue, this paper proposes the dependency-based LSTM model (D-LSTM). The D-LSTM divides a sentence representation into two parts: a basic component and a supporting component. The D-LSTM uses a pre-trained dependency parser to obtain the primary sentence information and generate supporting components, and it also uses a standard LSTM model to generate the basic sentence components. A weight factor that can adjust the ratio of the basic and supporting components in a sentence is introduced to generate the sentence representation. Compared with the representation learned by the standard LSTM, the sentence representation learned by the D-LSTM contains a greater amount of useful information. The experimental results show that the D-LSTM is superior to the standard LSTM for sentences involving compositional knowledge (SICK) data.

Learning Sparse Hidden States In Long Short-Term Memory

Dependency-based Siamese Long Short-Term Memory Network for Learning Sentence Representations.

Power Load Prediction Model Based on Long Short Term Memory and Sparrow Search Algorithm

Image Captioning with Sparse LTSM

SpikingSSMs: Learning Long Sequences with Sparse and Parallel Spiking State Space Models

ELSTM: An improved long short‐term memory network language model for sequence learning

NEWLSTM: an Optimized Long Short-Term Memory Language Model for Sequence Prediction.

Deep Learning with Long Short-Term Memory for Time Series Prediction

A Brain-Inspired Spiking Neural Network Model with Temporal Encoding and Learning

Learning Longer Memory in Recurrent Neural Networks

LSTM-SNP: A long short-term memory model inspired from spiking neural P systems

Learning to Forget: Continual Prediction with LSTM

E-LSTM: Efficient Inference of Sparse LSTM on Embedded Heterogeneous System

A Hybrid Spiking Neurons Embedded LSTM Network for Multivariate Time Series Learning under Concept-Drift Environment

SPikE-SSM: A Sparse, Precise, and Efficient Spiking State Space Model for Long Sequences Learning

Learning Low-Rank Structured Sparsity in Recurrent Neural Networks

Learning Long Sequences in Spiking Neural Networks

Data-Driven Predictive Modeling of Neuronal Dynamics using Long Short-Term Memory

A survey on long short-term memory networks for time series prediction

Coupling LSTM neural networks and state-space models through analytically tractable inference