Global-locality Preserving Projection for Word Embedding

Wang Bolin,Sun Yuanyuan,Chu Yonghe,Yang Zhihao,Lin Hongfei
DOI: https://doi.org/10.1007/s13042-022-01574-y
2022-01-01
International Journal of Machine Learning and Cybernetics
Abstract:Pre-trained word embedding has a significant impact on constructing representations for sentences, paragraphs and documents. However, existing word embedding methods are typically learned in the Euclidean space. Distributed word embedding suffers from inaccurate semantic similarity and high computational cost in the Euclidean metric space. In this study, we propose global-locality preserving projection to refine word representation by re-embedding word vectors from the original embedding space to a manifold semantic space. Our method extracts the local feature of the word vector and preserves the global feature of the word vector as well. It can discover the local geometric structure that also indicates the latent semantic structure and obtain a compact word embedding subspace. The performance of the method is assessed on several lexical-level intrinsic tasks of semantic similarity and semantic relatedness, and the experimental results demonstrate its advantages over other word embedding-based methods.
What problem does this paper attempt to address?