Abstract:Integrating lexical information into Chinese character embedding is a valid method to figure out the Chinese named entity recognition (NER) issue. However, most existing methods focus only on the discovery of named entity boundaries, considering only the words matched by the Chinese characters. They ignore the association between Chinese characters and their left and right matching words. They ignore the local semantic information of the character’s neighborhood, which is crucial for Chinese NER. The Chinese language incorporates a significant number of polysemous words, meaning that a single word can possess multiple meanings. Consequently, in the absence of sufficient contextual information, individuals may encounter difficulties in comprehending the intended meaning of a text, leading to the emergence of ambiguity. We consider how to handle the issue of entity ambiguity because of polysemous words in Chinese texts in different contexts more simply and effectively. We propose in this paper the use of graph attention networks to construct relatives among matching words and neighboring characters as well as matching words and adding left- and right-matching words directly using semantic information provided by the local lexicon. Moreover, this paper proposes a short-sequence convolutional neural network (SSCNN). It utilizes the generated shorter subsequence encoded with the sliding window module to enhance the perception of local information about the character. Compared with the widely used Chinese NER models, our approach achieves 1.18%, 0.29%, 0.18%, and 1.1% improvement on the four benchmark datasets Weibo, Resume, OntoNotes, and E-commerce, respectively, and proves the effectiveness of the model.

LSA-Based Chinese-Slavic Mongolian NER Disambiguation.

Multi-document Chinese Name Disambiguation Based on Latent Semantic Analysis

Entity disambiguation with context awareness in user-generated short texts

A Local Information Perception Enhancement–Based Method for Chinese NER

Scholar Name Disambiguation with Search-enhanced LLM Across Language

Entity Disambiguation via Fusion Entity Decoding

Using Lexical and Thematic Knowledge for Name Disambiguation.

Chinese Named Entity Recognition and Disambiguation Based on Multi-stage Clustering

Evaluating Entity Disambiguation and the Role of Popularity in Retrieval-Based NLP

Author Name Disambiguation via Heterogeneous Network Embedding from Structural and Semantic Perspectives

Leveraging Deep Neural Networks and Knowledge Graphs for Entity Disambiguation

Learning Entity Representation for Named Entity Disambiguation.

ELCA: Enhanced boundary location for Chinese named entity recognition via contextual association

SEN: A Subword-Based Ensemble Network for Chinese Historical Entity Extraction

Famous names: The Esing Bakery, Hong Kong.

Towards Effective Disambiguation for Machine Translation with Large Language Models

ADANA: Active Name Disambiguation

Linking Entities in Short Texts Based on a Chinese Semantic Knowledge Base

Research on the Application of a Chinese Semantic Knowledge Base in Chinese Phrase Disambiguation

Entity Disambiguation with Freebase.

Name Disambiguation Using Web Connection