Learning Multi-Prototype Word Embedding from Single-Prototype Word Embedding with Integrated Knowledge.

Xuefeng Yang,Kezhi Mao
DOI: https://doi.org/10.1016/j.eswa.2016.03.013
IF: 8.5
2016-01-01
Expert Systems with Applications
Abstract:Distributional semantic models (DSM) or word embeddings are widely used in prediction of semantic similarity and relatedness. However, context aware similarity and relatedness prediction is still a challenging issue because most DSM models or word embeddings use one vector per word without considering polysemy and homonym. In this paper, we propose a supervised fine tuning framework to transform the existing single-prototype word embeddings into multi-prototype word embeddings based on lexical semantic resources. As a post-processing step, the proposed framework is compatible with any sense inventory and any word embedding. To test the proposed learning framework, both intrinsic and extrinsic evaluations are conducted. Experiments results of 3 tasks with 8 datasets show that the multi-prototype word representations learned by the proposed framework outperform single-prototype word representations. (C) 2016 Elsevier Ltd. All rights reserved.
What problem does this paper attempt to address?