Enhancing Transformer with Sememe Knowledge.

Yuhui Zhang,Chenghao Yang,Zhengping Zhou,Zhiyuan Liu
DOI: https://doi.org/10.18653/v1/2020.repl4nlp-1.21
2020-01-01
Abstract:While large-scale pretraining has achieved great success in many NLP tasks, it has not been fully studied whether external linguistic knowledge can improve data-driven models. In this work, we introduce sememe knowledge into Transformer and propose three sememe-enhanced Transformer models. Sememes, by linguistic definition, are the minimum semantic units of language, which can well represent implicit semantic meanings behind words. Our experiments demonstrate that introducing sememe knowledge into Transformer can consistently improve language modeling and downstream tasks. The adversarial test further demonstrates that sememe knowledge can substantially improve model robustness.
What problem does this paper attempt to address?