Abstract:Integrating lexical information into Chinese character embedding is a valid method to figure out the Chinese named entity recognition (NER) issue. However, most existing methods focus only on the discovery of named entity boundaries, considering only the words matched by the Chinese characters. They ignore the association between Chinese characters and their left and right matching words. They ignore the local semantic information of the character’s neighborhood, which is crucial for Chinese NER. The Chinese language incorporates a significant number of polysemous words, meaning that a single word can possess multiple meanings. Consequently, in the absence of sufficient contextual information, individuals may encounter difficulties in comprehending the intended meaning of a text, leading to the emergence of ambiguity. We consider how to handle the issue of entity ambiguity because of polysemous words in Chinese texts in different contexts more simply and effectively. We propose in this paper the use of graph attention networks to construct relatives among matching words and neighboring characters as well as matching words and adding left- and right-matching words directly using semantic information provided by the local lexicon. Moreover, this paper proposes a short-sequence convolutional neural network (SSCNN). It utilizes the generated shorter subsequence encoded with the sliding window module to enhance the perception of local information about the character. Compared with the widely used Chinese NER models, our approach achieves 1.18%, 0.29%, 0.18%, and 1.1% improvement on the four benchmark datasets Weibo, Resume, OntoNotes, and E-commerce, respectively, and proves the effectiveness of the model.

Learning Sense-specific Word Embeddings By Exploiting Bilingual Resources.

Do Multi-Sense Embeddings Improve Natural Language Understanding?

Beyond Bilingual: Multi-sense Word Embeddings using Multilingual Context

Learning Word Sense Embeddings from Word Sense Definitions

A Comparison of Word Embeddings for English and Cross-Lingual Chinese Word Sense Disambiguation

A Local Information Perception Enhancement–Based Method for Chinese NER

Multi-phase Word Sense Embedding Learning Using a Corpus and a Lexical Ontology.

Leveraging Human Prior Knowledge to Learn Sense Representations

Multi-phase Word Sense Embedding Retrofitting with Lexical Ontology

Language Modelling Makes Sense: Propagating Representations through WordNet for Full-Coverage Word Sense Disambiguation

Together We Make Sense -- Learning Meta-Sense Embeddings from Pretrained Static Sense Embeddings

Learning Context-Sensitive Word Embeddings with Neural Tensor Skip-Gram Model

Addressing the Polysemy Problem in Language Modeling with Attentional Multi-Sense Embeddings

Learning Context-Specific Word/Character Embeddings.

Constructing High Quality Sense-specific Corpus and Word Embedding Via Unsupervised Elimination of Pseudo Multi-sense.

xSense: Learning Sense-Separated Sparse Representations and Textual Definitions for Explainable Word Sense Networks

Chinese Word Sense Embedding with SememeWSD and Synonym Set

On Modeling Sense Relatedness in Multi-prototype Word Embedding.

Improved Learning of Chinese Word Embeddings with Semantic Knowledge.

An Exploration Of Semantic Relations In Neural Word Embeddings Using Extrinsic Knowledge

Real Multi-Sense or Pseudo Multi-Sense: an Approach to Improve Word Representation