Abstract:Integrating lexical information into Chinese character embedding is a valid method to figure out the Chinese named entity recognition (NER) issue. However, most existing methods focus only on the discovery of named entity boundaries, considering only the words matched by the Chinese characters. They ignore the association between Chinese characters and their left and right matching words. They ignore the local semantic information of the character’s neighborhood, which is crucial for Chinese NER. The Chinese language incorporates a significant number of polysemous words, meaning that a single word can possess multiple meanings. Consequently, in the absence of sufficient contextual information, individuals may encounter difficulties in comprehending the intended meaning of a text, leading to the emergence of ambiguity. We consider how to handle the issue of entity ambiguity because of polysemous words in Chinese texts in different contexts more simply and effectively. We propose in this paper the use of graph attention networks to construct relatives among matching words and neighboring characters as well as matching words and adding left- and right-matching words directly using semantic information provided by the local lexicon. Moreover, this paper proposes a short-sequence convolutional neural network (SSCNN). It utilizes the generated shorter subsequence encoded with the sliding window module to enhance the perception of local information about the character. Compared with the widely used Chinese NER models, our approach achieves 1.18%, 0.29%, 0.18%, and 1.1% improvement on the four benchmark datasets Weibo, Resume, OntoNotes, and E-commerce, respectively, and proves the effectiveness of the model.

Incorporating Part of Speech Information in Span Representation for Named Entity Recognition

SpanNER: Named Entity Re-/Recognition As Span Prediction

S-NER: A Concise and Efficient Span-Based Model for Named Entity Recognition

Deep Span Representations for Named Entity Recognition

A segment enhanced span-based model for nested named entity recognition

Span-Based Nested Named Entity Recognition with Pretrained Language Model

SpanProto: A Two-stage Span-based Prototypical Network for Few-shot Named Entity Recognition

Handling Negative Samples Problems in Span-Based Nested Named Entity Recognition

Full-span named entity recognition with boundary regression

Bi-directional context-aware network for the nested named entity recognition

Joint Learning of Token Context and Span Feature for Span-Based Nested NER

Enhanced Language Representation with Label Knowledge for Span Extraction

Span-based joint entity and relation extraction augmented with sequence tagging mechanism

Boosting Span-based Joint Entity and Relation Extraction via Squence Tagging Mechanism

A simple but effective span-level tagging method for discontinuous named entity recognition

A Boundary Offset Prediction Network for Named Entity Recognition

Global Span Semantic Dependency Awareness and Filtering Network for nested named entity recognition

Locate and Label: A Two-stage Identifier for Nested Named Entity Recognition

CLESR: Context-Based Label Knowledge Enhanced Span Recognition for Named Entity Recognition

A Local Information Perception Enhancement–Based Method for Chinese NER

Win-Win Cooperation: Bundling Sequence and Span Models for Named Entity Recognition