Abstract:Integrating lexical information into Chinese character embedding is a valid method to figure out the Chinese named entity recognition (NER) issue. However, most existing methods focus only on the discovery of named entity boundaries, considering only the words matched by the Chinese characters. They ignore the association between Chinese characters and their left and right matching words. They ignore the local semantic information of the character’s neighborhood, which is crucial for Chinese NER. The Chinese language incorporates a significant number of polysemous words, meaning that a single word can possess multiple meanings. Consequently, in the absence of sufficient contextual information, individuals may encounter difficulties in comprehending the intended meaning of a text, leading to the emergence of ambiguity. We consider how to handle the issue of entity ambiguity because of polysemous words in Chinese texts in different contexts more simply and effectively. We propose in this paper the use of graph attention networks to construct relatives among matching words and neighboring characters as well as matching words and adding left- and right-matching words directly using semantic information provided by the local lexicon. Moreover, this paper proposes a short-sequence convolutional neural network (SSCNN). It utilizes the generated shorter subsequence encoded with the sliding window module to enhance the perception of local information about the character. Compared with the widely used Chinese NER models, our approach achieves 1.18%, 0.29%, 0.18%, and 1.1% improvement on the four benchmark datasets Weibo, Resume, OntoNotes, and E-commerce, respectively, and proves the effectiveness of the model.

Sememe Tree Prediction for English-Chinese Word Pairs.

Predicting Categorial Sememe for English-Chinese Word Pairs via Representations in Explainable Sememe Space.

Chinese Lexical Sememe Prediction Using CilinE Knowledge

Towards Building a Multilingual Sememe Knowledge Base: Predicting Sememes for BabelNet Synsets

Sememe Prediction for BabelNet Synsets Using Multilingual and Multimodal Information

Incorporating Chinese Characters of Words for Lexical Sememe Prediction

A Sememe Prediction Method Based on the Central Word of a Semantic Field

Try to Substitute: an Unsupervised Chinese Word Sense Disambiguation Method Based on HowNet.

Going Beyond Multi-Task Dense Prediction with Synergy Embedding Models

Lexical Sememe Prediction using Dictionary Definitions by Capturing Local Semantic Correspondence

Lexical Sememe Prediction Via Word Embeddings and Matrix Factorization.

Sememe Prediction: Learning Semantic Knowledge from Unstructured Textual Wiki Descriptions.

Sememe Knowledge Computation: a Review of Recent Advances in Application and Expansion of Sememe Knowledge Bases

Cross-lingual Lexical Sememe Prediction

Chinese Word Sense Embedding with SememeWSD and Synonym Set

Going "Deeper": Structured Sememe Prediction via Transformer with Tree Attention

A Local Information Perception Enhancement–Based Method for Chinese NER

SememeLM: A Sememe Knowledge Enhanced Method for Long-tail Relation Representation

Automatic Construction of Sememe Knowledge Bases via Dictionaries.

Chinese LIWC Lexicon Expansion Via Hierarchical Classification of Word Embeddings with Sememe Attention

Chinese Word Similarity Computing Based on Semantic Tree