Abstract:Named entity recognition (NER) aims to identify and classify specific entities mentioned in textual sentences. Most existing superior NER models employ the standard fully supervised paradigm, which requires a large amount of annotated data during training. In order to maintain performance with insufficient annotation resources (i.e., low resources), in-context learning (ICL) has drawn a lot of attention, due to its plug-and-play nature compared to other methods (e.g., meta-learning and prompt learning). In this manner, how to retrieve high-correlated demonstrations for target sentences serves as the key to emerging ICL ability. For the NER task, the correlation implies the consistency of both ontology (i.e., generalized entity type) and context (i.e., sentence semantic), which is ignored by previous NER demonstration retrieval techniques. To address this issue, we propose ConsistNER, a novel three-stage framework that incorporates ontological and contextual information for low-resource NER. Firstly, ConsistNER employs large language models (LLMs) to pre-recognize potential entities in a zero-shot manner. Secondly, ConsistNER retrieves the sentence-specific demonstrations for each target sentence based on the two following considerations: (1) Regarding ontological consistency, demonstrations are filtered into a candidate set based on ontology distribution. (2) Regarding contextual consistency, an entity-aware self-attention mechanism is introduced to focus more on the potential entities and semantic-correlated tokens. Finally, ConsistNER feeds the retrieved demonstrations for all target sentences into LLMs for prediction. We conduct experiments on four widely-adopted NER datasets, including both general and specific domains. Experimental results show that ConsistNER achieves a 6.01%-26.37% and 3.07%-21.18% improvement over the state-of-the-art baselines on Micro-F1 scores under 1- and 5-shot settings, respectively.

Enhancing Low-Resource NLP by Consistency Training with Data and Model Perturbations

ConsistTL: Modeling Consistency in Transfer Learning for Low-Resource Neural Machine Translation

Semi-supervised Neural Machine Translation with Consistency Regularization for Low-Resource Languages

ConsistNER: Towards Instructive NER Demonstrations for LLMs with the Consistency of Ontology and Context

Towards Reliable Neural Machine Translation with Consistency-Aware Meta-Learning

Improving the Robustness of Large Language Models via Consistency Alignment

Handling Syntactic Divergence in Low-resource Machine Translation

Stable Consistency Tuning: Understanding and Improving Consistency Models

Effective Transfer Learning for Low-Resource Natural Language Understanding

3Rs:Data Augmentation Techniques Using Document Contexts For Low-Resource Chinese Named Entity Recognition

Enhancing Self-Consistency and Performance of Pre-Trained Language Models through Natural Language Inference

Unlocking the Potential of Model Merging for Low-Resource Languages

Towards better Chinese-centric neural machine translation for low-resource languages

Enhanced Meta-Learning for Cross-lingual Named Entity Recognition with Minimal Resources.

DCR-Consistency: Divide-Conquer-Reasoning for Consistency Evaluation and Improvement of Large Language Models

A Dual-Contrastive Framework for Low-Resource Cross-Lingual Named Entity Recognition

Accurate, yet inconsistent? Consistency Analysis on Language Understanding Models

MoE-CT: A Novel Approach For Large Language Models Training With Resistance To Catastrophic Forgetting

Improving Non-autoregressive Machine Translation with Error Exposure and Consistency Regularization

Building Low-Resource NER Models Using Non-Speaker Annotation

Improving Data Augmentation for Low-Resource NMT Guided by POS-Tagging and Paraphrase Embedding