Abstract:Abstract With rapid development of the Internet, people have undergone tremendous changes in the way they obtain information. In recent years, knowledge graph is becoming a popular tool for the public to acquire knowledge. For knowledge graph of Chinese history and culture, most researchers adopted traditional named entity recognition methods to extract entity information from unstructured historical text data. However, the traditional named entity recognition method has certain defects, and it is easy to ignore the association between entities. To extract entities from a large amount of historical and cultural information more accurately and efficiently, this paper proposes one named entity recognition model combining Bidirectional Encoder Representations from Transformers and Bidirectional Long Short-Term Memory-Conditional Random Field (BERT-BiLSTM-CRF). First, a BERT pre-trained language model is used to encode a single character to obtain a vector representation corresponding to each character. Then one Bidirectional Long Short-Term Memory (BiLSTM) layer is applied to semantically encode the input text. Finally, the label with the highest probability is output through the Conditional Random Field (CRF) layer to obtain each character’s category. This model uses the Bidirectional Encoder Representations from Transformers (BERT) pre-trained language model to replace the static word vectors trained in the traditional way. In comparison, the BERT pre-trained language model can dynamically generate semantic vectors according to the context of words, which improves the representation ability of word vectors. The experimental results prove that the model proposed in this paper has achieved excellent results in the task of named entity recognition in the field of historical culture. Compared with the existing named entity identification methods, the precision rate, recall rate, and $$F_1$$ F 1 value have been significantly improved.

KGNER: Improving Chinese Named Entity Recognition by BERT Infused with the Knowledge Graph

A hybrid Transformer approach for Chinese NER with features augmentation

Ernie: Enhanced Language Representation With Informative Entities

Semantic-enhanced graph neural network for named entity recognition in ancient Chinese books

K-BERT: Enabling Language Representation with Knowledge Graph

A BERT based Chinese Named Entity Recognition method on ASEAN News

Data Augmentation with Knowledge Graph-to-Text and Virtual Adversary for Specialized-Domain Chinese NER

KI-BERT: Infusing Knowledge Context for Better Language and Domain Understanding

Neural Axiom Network for Knowledge Graph Reasoning

A More Efficient Chinese Named Entity Recognition base on BERT and Syntactic Analysis

A Local Information Perception Enhancement–Based Method for Chinese NER

KCB-FLAT: Enhancing Chinese Named Entity Recognition with Syntactic Information and Boundary Smoothing Techniques

Chinese Named Entity Recognition Method in History and Culture Field Based on BERT

FGN: Fusion Glyph Network for Chinese Named Entity Recognition

Using Chinese Glyphs for Named Entity Recognition

LB-BMBC: MHBiaffine-CNN to Capture Span Scores with BERT Injected with Lexical Information for Chinese NER

BertNet: Harvesting Knowledge Graphs from Pretrained Language Models

ERNIE: Enhanced Representation through Knowledge Integration

Improve on Entity Recognition Method Based on BiLSTM-CRF Model for the Nuclear Technology Knowledge Graph

A Chinese named entity recognition model: integrating label knowledge and lexicon information