Abstract:The Chinese named entity recognition (NER) task is a sub-task within the information extraction domain, where the task goal is to find, identify and classify relevant entities, such as names of people, places and organizations, from sentences given a piece of unstructured text. Chinese named entity recognition is a fundamental task in the field of natural language processing (NLP) and plays an important role in many downstream NLP tasks, including information retrieval, relationship extraction and question and answer systems. This paper provides a comprehensive review of existing neural network-based word-character lattice structures for Chinese NER models. Firstly, this paper introduces that Chinese NER is more difficult than English NER, and there are difficulties and challenges such as difficulty in determining the boundaries of Chinese text-related entities and complex Chinese grammatical structures. Secondly, this paper investigates the most representative lattice-structured Chinese NER models under different neural network architectures (RNN (recurrent neural network), CNN (convolutional neural network), GNN (graph neural network) and Transformer). Since word sequence information can capture more boundary information for character-based sequence learning, in order to explicitly exploit the lexical information associated with each character, some prior work has proposed integrating word information into character sequences via word-character lattice structures. These neural network-based word-character lattice structures perform significantly better than word-based or character-based approaches on the Chinese NER task. Finally, this paper introduces the dataset and evaluation criteria of Chinese NER.

Chinese Named Entity Recognition Using a Morpheme-Based Chunking Tagger

Chinese word segmentation as morpheme-based lexical chunking

Pronounce Differently, Mean Differently: A Multi-Tagging-scheme Learning Method for Chinese NER Integrated with Lexicon and Phonetic Features

Tagging Schemes Can Do More in Named Entity Recognition: Take Chinese As an Example

A Local Information Perception Enhancement–Based Method for Chinese NER

Using Chinese Glyphs for Named Entity Recognition

Chinese Named Entity Recognition Augmented with Lexicon Memory

A Chinese Named Entity Recognition System with Neural Networks

Chinese named entity recognition with a hybrid-statistical model

Chinese Named Entity Recognition with Character-Word Mixed Embedding.

CHINERS

Chinese Text Chunking Using Lexicalized HMMs

Exploiting Character-Word Fusion to Enhance Chinese Named Entity Recognition Combined with Multi-head Attention Mechanism

Survey of Chinese Named Entity Recognition

MFE-NER: Multi-feature Fusion Embedding for Chinese Named Entity Recognition

A More Efficient Chinese Named Entity Recognition base on BERT and Syntactic Analysis

Correcting Word Segmentation and Part-of-Speech Tagging Errors for Chinese Named Entity Recognition

Improving Chinese Named Entity Recognition by Large-Scale Syntactic Dependency Graph

Hierarchical Lexicon Embedding Architecture for Chinese Named Entity Recognition

Chinese Named Entity Recognition Fusing Lexical and Syntactic Information.

Enhanced Chinese Domain Named Entity Recognition: An Approach with Lexicon Boundary and Frequency Weight Features