Abstract: The task of named entity recognition (NER) is normally divided into nested NER and flat NER depending on whether named entities are nested or not. Models are usually separately developed for the two tasks, since sequence labeling models, the most widely used backbone for flat NER, are only able to assign a single label to a particular token, which is unsuitable for nested NER where a token may be assigned several labels. In this paper, we propose a unified framework that is capable of handling both flat and nested NER tasks. Instead of treating the task of NER as a sequence labeling problem, we propose to formulate it as a machine reading comprehension (MRC) task. For example, extracting entities with the \textsc{per} label is formalized as extracting answer spans to the question "{\it which person is mentioned in the text?}". This formulation naturally tackles the entity overlapping issue in nested NER: the extraction of two overlapping entities for different categories requires answering two independent questions. Additionally, since the query encodes informative prior knowledge, this strategy facilitates the process of entity extraction, leading to better performances for not only nested NER, but flat NER. We conduct experiments on both {\em nested} and {\em flat} NER datasets. Experimental results demonstrate the effectiveness of the proposed formulation. We are able to achieve vast amount of performance boost over current SOTA models on nested NER datasets, i.e., +1.28, +2.55, +5.44, +6.37, respectively on ACE04, ACE05, GENIA and KBP17, along with SOTA results on flat NER datasets, i.e.,+0.24, +1.95, +0.21, +1.49 respectively on English CoNLL 2003, English OntoNotes 5.0, Chinese MSRA, Chinese OntoNotes 4.0.

On the Strength of Character Language Models for Multilingual Named Entity Recognition

mCL-NER: Cross-Lingual Named Entity Recognition via Multi-view Contrastive Learning

Incorporating Large Language Models into Named Entity Recognition: Opportunities and Challenges

Chinese Named Entity Recognition with a Multi-Phase Model

A Local Information Perception Enhancement–Based Method for Chinese NER

Large Language Models Struggle in Token-Level Clinical Named Entity Recognition

What Matters for Neural Cross-Lingual Named Entity Recognition: An Empirical Analysis

Multicultural Name Recognition For Previously Unseen Names

Better Character Language Modeling Through Morphology

CLLMFS: A Contrastive Learning enhanced Large Language Model Framework for Few-Shot Named Entity Recognition

GEIC: Universal and Multilingual Named Entity Recognition with Large Language Models

Empirical Study on Character Level Neural Network Classifier for Chinese Text.

LTNER: Large Language Model Tagging for Named Entity Recognition with Contextualized Entity Marking

Pronounce Differently, Mean Differently: A Multi-Tagging-scheme Learning Method for Chinese NER Integrated with Lexicon and Phonetic Features

Using Chinese Glyphs for Named Entity Recognition

A Chinese named entity recognition model: integrating label knowledge and lexicon information

Adversarial Multi-Task Learning for Efficient Chinese Named Entity Recognition

Chinese Clinical Named Entity Recognition with ALBERT and MHA Mechanism

Neural Named Entity Recognition from Subword Units

A Unified MRC Framework for Named Entity Recognition

Adversarial Transfer Learning for Chinese Named Entity Recognition with Self-Attention Mechanism.