Unsupervised Learning of Sentence Embeddings using Compositional n-Gram Features

Matteo Pagliardini,Prakhar Gupta,Martin Jaggi
DOI: https://doi.org/10.18653/v1/N18-1049
2018-12-28
Abstract:The recent tremendous success of unsupervised word embeddings in a multitude of applications raises the obvious question if similar methods could be derived to improve embeddings (i.e. semantic representations) of word sequences as well. We present a simple but efficient unsupervised objective to train distributed representations of sentences. Our method outperforms the state-of-the-art unsupervised models on most benchmark tasks, highlighting the robustness of the produced general-purpose sentence embeddings.
Computation and Language,Artificial Intelligence,Information Retrieval
What problem does this paper attempt to address?