Abstract:Chinese Named Entity Recognition (NER) is a very important subtask in the information extraction domain. Its purpose is to locate named entities in the text and classify them into predetermined categories. The key point of NER is to learn a high-quality representation of tokens. Recently, representation learning techniques have been introduced into NER due to their excellent performance in mining the semantics of texts and mastering their organization. In the field of Chinese, many studies introduce multimodal feature extraction schemes to enrich token representations, such as radical and word. However, the learning scheme of these auxiliary features is relatively complex and has difficulty learning the interactive relationship between features with a concatenation or MLP fusion method. To address these challenges, a Multimodal Chinese NER Model based on Self-attention Mechanism named MNER is proposed, which consists of a multimodal feature fusion module and an entity classification module. To study the informative characterization of tokens, a multimodal feature fusion module is proposed to exploit radical, character, and word information. In the multimodal feature fusion module, a self-attention mechanism is designed to integrate multimodal features based on the correlation between the features, which addresses the problem that existing methods have difficulty exploiting interaction information between features. A semantic-aware category modifier is proposed to enhance the CRF classification layer's performance. It increases entity discrimination by adjusting the embeddings of features according to the similarity between the token embeddings and embedding of each entity category, which widens the encoding gap between different entities and narrows the search scope for entity classification. Finally, the proposed MNER is compared with ten state-of-the-art methods on Weibo and Resume datasets, and the results show the superiority of our model on three metrics.

A Novel Chinese Resume Named Entity Recognition Model Based on Lexical Enhancement.

A Local Information Perception Enhancement–Based Method for Chinese NER

Improving Chinese Named Entity Recognition Based on Lexical Information Adjustment

Enhanced Chinese Domain Named Entity Recognition: An Approach with Lexicon Boundary and Frequency Weight Features

Research of Chinese Resume Analysis Based on Feature Fusion

Chinese Named Entity Recognition Augmented with Lexicon Memory

A Novel Character-Word Fusion Chinese Named Entity Recognition Model Based on Attention Mechanism

A Chinese Named Entity Recognition System with Neural Networks

Pronounce Differently, Mean Differently: A Multi-Tagging-scheme Learning Method for Chinese NER Integrated with Lexicon and Phonetic Features

Using Chinese Glyphs for Named Entity Recognition

Exploiting Character-Word Fusion to Enhance Chinese Named Entity Recognition Combined with Multi-head Attention Mechanism

Multimodal Features Enhanced Named Entity Recognition Based on Self-Attention Mechanism.

Joint Self-Attention and Multi-Embeddings for Chinese Named Entity Recognition

Chinese Named Entity Recognition Fusing Lexical and Syntactic Information.

A Residual BiLSTM Model for Named Entity Recognition

Chinese Named Entity Recognition with a Multi-Phase Model

ELCA: Enhanced boundary location for Chinese named entity recognition via contextual association

Hierarchical LSTM with char-subword-word tree-structure representation for Chinese named entity recognition

A Chinese named entity recognition model incorporating recurrent cell and information state recursion

Hierarchical Contextualized Representation for Named Entity Recognition.

LSTM-CRF Neural Network with Gated Self Attention for Chinese NER