Chinese Lexical Sememe Prediction Using CilinE Knowledge

Hao Wang,Sirui Liu,Jianyong Duan,Li He,Xin Li
DOI: https://doi.org/10.1587/transfun.2022eap1074
2023-01-01
Abstract:Sememes are the smallest semantic units of human lan-guages, the composition of which can represent the meaning of words. Se-memes have been successfully applied to many downstream applications in natural language processing (NLP) field. Annotation of a word's sememes depends on language experts, which is both time-consuming , labor -consuming, limiting the large-scale application of sememe. Researchers have proposed some sememe prediction methods to automatically predict sememes for words. However, existing sememe prediction methods focus on information of the word itself, ignoring the expert-annotated knowledge bases which indicate the relations between words and should value in se-meme predication. Therefore, we aim at incorporating the expert-annotated knowledge bases into sememe prediction process. To achieve that, we propose a CilinE-guided sememe prediction model which employs an ex-isting word knowledge base CilinF to remodel the sememe prediction from relational perspective. Experiments on HowNet, a widely used Chinese sememe knowledge base, have shown that CilinE has an obvious positive effect on sememe prediction. Furthermore, our proposed method can be integrated into existing methods and significantly improves the prediction performance. We will release the data and code to the public.
What problem does this paper attempt to address?