Abstract:It is widely accepted that traditional word embedding models, which rely on distributional semantics hypothesis, are relatively limited for contrast meaning problem. Distributional semantics hypothesis indicates that words lying in similar contexts have similar representations in vector space. Nevertheless, synonyms and antonyms often locate in similar contexts, which means they appear close to each other in vector space. Hence, it is of great difficulty to distinguish antonyms from synonyms. To address this challenge, we propose an optimization model, named Lexicon-based Word Embedding Tuning (LWET) model. The goal of LWET is to incorporate reliable semantic lexicons to tune the distributions of pre-trained word embeddings in the vector space so as to improve their ability of distinguishing antonyms from synonyms. To speed up the training process of LWET, we propose two approximation algorithms, including positive sampling and quasi-hierarchical softmax. Compared with quasi-hierarchical softmax, positive sampling is faster, however, at the cost of worse performance. In experiments, LWET and other state-of-the-art models are tested on antonyms recognition, distinguishing antonyms from synonyms and word similarity. The results of the first two experiments show that LWET significantly improves the ability of word embeddings to detect antonyms, thus achieving the state-of-the-art performance. On word similarity, LWET gets slightly better performance than the state-of-the-art models. It means that LWET can remain and strengthen the semantic structure rather than destroy it when tuning word distributions in vector space. In general, compared with related work, LWET can not only achieve similar or even better performance, but also speed up the training process.

Using a Chinese Lexicon to Learn Sense Embeddings and Measure Semantic Similarity.

Do Multi-Sense Embeddings Improve Natural Language Understanding?

Chinese Word Sense Embedding with SememeWSD and Synonym Set

Learning Sense-specific Word Embeddings By Exploiting Bilingual Resources.

Learning Word Sense Embeddings from Word Sense Definitions

Leveraging Human Prior Knowledge to Learn Sense Representations

Revisit Word Embeddings with Semantic Lexicons for Modeling Lexical Contrast

Not All Synonyms Are Created Equal: Incorporating Similarity of Synonyms to Enhance Word Embeddings

Improved Learning of Chinese Word Embeddings with Semantic Knowledge.

Real Multi-Sense or Pseudo Multi-Sense: an Approach to Improve Word Representation

Multi-phase Word Sense Embedding Learning Using a Corpus and a Lexical Ontology.

Multi-phase Word Sense Embedding Retrofitting with Lexical Ontology

Measuring Word Polysemousness And Sense Granularity At A Language Level

A Local Information Perception Enhancement–Based Method for Chinese NER

Beyond Bilingual: Multi-sense Word Embeddings using Multilingual Context

PolyLM: Learning about Polysemy through Language Modeling

Addressing the Polysemy Problem in Language Modeling with Attentional Multi-Sense Embeddings

Chinese LIWC Lexicon Expansion Via Hierarchical Classification of Word Embeddings with Sememe Attention

Improved Word Representation Learning with Sememes

Evaluation of taxonomic and neural embedding methods for calculating semantic similarity

Together We Make Sense -- Learning Meta-Sense Embeddings from Pretrained Static Sense Embeddings