Abstract:Textual representations play an important role in the field of natural language processing (NLP). The efficiency of NLP tasks, such as text comprehension and information extraction, can be significantly improved with proper textual representations. As neural networks are gradually applied to learn the representation of words and phrases, fairly efficient models of learning short text representations have been developed, such as the continuous bag of words (CBOW) and skip-gram models, and they have been extensively employed in a variety of NLP tasks. Because of the complex structure generated by the longer text lengths, such as sentences, algorithms appropriate for learning short textual representations are not applicable for learning long textual representations. One method of learning long textual representations is the Long Short-Term Memory (LSTM) network, which is suitable for processing sequences. However, the standard LSTM does not adequately address the primary sentence structure (subject, predicate and object), which is an important factor for producing appropriate sentence representations. To resolve this issue, this paper proposes the dependency-based LSTM model (D-LSTM). The D-LSTM divides a sentence representation into two parts: a basic component and a supporting component. The D-LSTM uses a pre-trained dependency parser to obtain the primary sentence information and generate supporting components, and it also uses a standard LSTM model to generate the basic sentence components. A weight factor that can adjust the ratio of the basic and supporting components in a sentence is introduced to generate the sentence representation. Compared with the representation learned by the standard LSTM, the sentence representation learned by the D-LSTM contains a greater amount of useful information. The experimental results show that the D-LSTM is superior to the standard LSTM for sentences involving compositional knowledge (SICK) data.

MTLAN: Multi-Task Learning and Auxiliary Network for Enhanced Sentence Embedding

Dependency-based Siamese Long Short-Term Memory Network for Learning Sentence Representations.

Learning Multilingual Sentence Embeddings From Monolingual Corpus

Unsupervised Cross-Lingual Sentence Representation Learning via Linguistic Isomorphism

CMLM-CSE: Based on Conditional MLM Contrastive Learning for Sentence Embeddings

Improving Multi-lingual Alignment Through Soft Contrastive Learning

Leveraging Multi-lingual Positive Instances in Contrastive Learning to Improve Sentence Embedding

Learning Multilingual Sentence Representations with Cross-lingual Consistency Regularization

Exploring Multilingual Syntactic Sentence Representations

FC-MTLF: A Fine- and Coarse-grained Multi-Task Learning Framework for Cross-Lingual Spoken Language Understanding.

Scalable Attentive Sentence-Pair Modeling via Distilled Sentence Embedding

EMS: Efficient and Effective Massively Multilingual Sentence Embedding Learning

Multi-Task Contrastive Learning for 8192-Token Bilingual Text Embeddings

Improving Multilingual Sentence Embedding using Bi-directional Dual Encoder with Additive Margin Softmax

Revisiting Language Encoding in Learning Multilingual Representations

Improving Multilingual Semantic Textual Similarity with Shared Sentence Encoder for Low-resource Languages

Lightweight Cross-Lingual Sentence Representation Learning

Non-Linguistic Supervision for Contrastive Learning of Sentence Embeddings

An Unsupervised Sentence Embedding Method Bymutual Information Maximization.

Enhancing Cross-lingual Sentence Embedding for Low-resource Languages with Word Alignment

Sentence-Embedding and Similarity via Hybrid Bidirectional-LSTM and CNN Utilizing Weighted-Pooling Attention