Chinese Embedding via Stroke and Glyph Information: A Dual-channel View

Hanqing Tao,Shiwei Tong,Tong Xu,Qi Liu,Enhong Chen
DOI: https://doi.org/10.48550/arXiv.1906.04287
2019-06-03
Computation and Language
Abstract:Recent studies have consistently given positive hints that morphology is helpful in enriching word embeddings. In this paper, we argue that Chinese word embeddings can be substantially enriched by the morphological information hidden in characters which is reflected not only in strokes order sequentially, but also in character glyphs spatially. Then, we propose a novel Dual-channel Word Embedding (DWE) model to realize the joint learning of sequential and spatial information of characters. Through the evaluation on both word similarity and word analogy tasks, our model shows its rationality and superiority in modelling the morphology of Chinese.
What problem does this paper attempt to address?