Abstract:Distributed word representations have been widely used and proven to be useful in quite a few natural language processing and text mining tasks. Most of existing word embedding models aim at generating only one embedding vector for each individual word, which, however, limits their effectiveness because huge amounts of words are polysemous (such as bank and star). To address this problem, it is necessary to build multi embedding vectors to represent different meanings of a word respectively. Some recent studies attempted to train multi-prototype word embeddings through clustering context window features of the word. However, due to a large number of parameters to train, these methods yield limited scalability and are inefficient to be trained with big data. In this paper, we introduce a much more efficient method for learning multi embedding vectors for polysemous words. In particular, we first propose to model word polysemy from a probabilistic perspective and integrate it with the highly efficient continuous Skip-Gram model. Under this framework, we design an Expectation-Maximization algorithm to learn the word’s multi embedding vectors. With much less parameters to train, our model can achieve comparable or even better results on word-similarity tasks compared with conventional methods.

Learning Ordered Word Representations

Learning Ordered Word Representations with Γ-Decay Dropout

Enhanced Double-Carrier Word Embedding Via Phonetics and Writing

Learning Word Embedding with Better Distance Weighting and Window Size Scheduling

WordRank: Learning Word Embeddings via Robust Ranking

Learning Word Sense Embeddings from Word Sense Definitions

Category Enhanced Word Embedding.

Ordered and Binary Speaker Embedding

Learning word embeddings from dependency relations

Learning Chinese Word Embeddings from Stroke, Structure and Pinyin of Characters

Learning Context-Specific Word/Character Embeddings.

Visual Exploration and Comparison of Word Embeddings.

Learning Word Representations with Hierarchical Sparse Coding

Delta Embedding Learning

Investigating Language Universal and Specific Properties in Word Embeddings

Learning Word Embeddings from Intrinsic and Extrinsic Views

Word Embedding Revisited: A New Representation Learning and Explicit Matrix Factorization Perspective.

Learning Sparse Overcomplete Word Vectors Without Intermediate Dense Representations

Learning word representation by jointly using neighbor and syntactic contexts

A Probabilistic Model for Learning Multi-Prototype Word Embeddings.

Learning Effective Word Embedding Using Morphological Word Similarity