Abstract:Integrating lexical information into Chinese character embedding is a valid method to figure out the Chinese named entity recognition (NER) issue. However, most existing methods focus only on the discovery of named entity boundaries, considering only the words matched by the Chinese characters. They ignore the association between Chinese characters and their left and right matching words. They ignore the local semantic information of the character’s neighborhood, which is crucial for Chinese NER. The Chinese language incorporates a significant number of polysemous words, meaning that a single word can possess multiple meanings. Consequently, in the absence of sufficient contextual information, individuals may encounter difficulties in comprehending the intended meaning of a text, leading to the emergence of ambiguity. We consider how to handle the issue of entity ambiguity because of polysemous words in Chinese texts in different contexts more simply and effectively. We propose in this paper the use of graph attention networks to construct relatives among matching words and neighboring characters as well as matching words and adding left- and right-matching words directly using semantic information provided by the local lexicon. Moreover, this paper proposes a short-sequence convolutional neural network (SSCNN). It utilizes the generated shorter subsequence encoded with the sliding window module to enhance the perception of local information about the character. Compared with the widely used Chinese NER models, our approach achieves 1.18%, 0.29%, 0.18%, and 1.1% improvement on the four benchmark datasets Weibo, Resume, OntoNotes, and E-commerce, respectively, and proves the effectiveness of the model.

Effective Bilingual Constraints for Semi-Supervised Learning of Named Entity Recognizers.

A Local Information Perception Enhancement–Based Method for Chinese NER

Dependency syntax guided BERT-BiLSTM-GAM-CRF for Chinese NER

A Unified Model for Cross-Domain and Semi-Supervised Named Entity Recognition in Chinese Social Media

Pronounce Differently, Mean Differently: A Multi-Tagging-scheme Learning Method for Chinese NER Integrated with Lexicon and Phonetic Features

Cross-Domain and Semisupervised Named Entity Recognition in Chinese Social Media: A Unified Model.

Enhanced Chinese Domain Named Entity Recognition: An Approach with Lexicon Boundary and Frequency Weight Features

Mutual Reinforcement Effects in Japanese Sentence Classification and Named Entity Recognition Tasks

Win-Win Cooperation: Bundling Sequence and Span Models for Named Entity Recognition

Towards Lingua Franca Named Entity Recognition with BERT

Semantic Role Labeling Integrated with Multilevel Linguistic Cues and Bi-LSTM-CRF

Using Chinese Glyphs for Named Entity Recognition

Enhanced Meta-Learning for Cross-lingual Named Entity Recognition with Minimal Resources.

A More Efficient Chinese Named Entity Recognition base on BERT and Syntactic Analysis

SEN: A Subword-Based Ensemble Network for Chinese Historical Entity Extraction

Chinese Named Entity Recognition with the Improved Smoothed Conditional Random Fields

Mix of Experts Language Model for Named Entity Recognition

Improving Chinese SRL with Heterogeneous Annotations

Chinese Named Entity Recognition Method Combining ALBERT and a Local Adversarial Training and Adding Attention Mechanism

Cross-Lingual Named Entity Recognition Based on Attention and Adversarial Training

A Double Adversarial Network Model for Multi-Domain and Multi-Task Chinese Named Entity Recognition