Abstract:Named entity recognition (NER) is to identify and categorize entities in unstructured text, which serves as a fundamental task for a variety of natural language processing (NLP) applications. In particular, emerging few-shot NER methods aim to learn model parameters well with few samples and have received considerable attention. The dominant few-shot NER methods usually employ pre-trained language models (PLMs) as their basic architecture and fine-tune model parameters with few NER samples. Since the sample size is small and there are a large number of parameters in PLMs, fine-tuning may result in the parameters of PLMs being highly biased. To address this issue, this study introduces the semantic distribution distance constraints to optimize the fine-tuning process of few-shot NER models and develops a framework named Semantic Constraints on few-shot Named Entity Recognition (SCNER). Specifically, the framework formulates the general knowledge transfer of PLMs as an optimal transport procedure with a semantic prior. And, a Semantics-induced Optimal Transport (SOT) regularizer is developed to utilize the importance and similarities of tokens within sentences. SOT builds the semantic distribution of the sentence and defines the transport costs between tokens to achieve the token-level optimal transport procedures. Finally, SOT is employed as a regularization term of few-shot NER to introduce the semantic distribution distance constraint for effectively transferring general knowledge from PLMs. The experiments on four public datasets demonstrate that the proposed method significantly improves the performance of NER models in both few-shot and fully supervised scenarios. SCNER is a common framework that can be applied to a variety of models without adding additional learning parameters, and can be used to enhance the generalization ability and adaptability of various few-shot NER models.

Logit Adjustment with Normalization and Augmentation in Few-Shot Named Entity Recognition

CLLMFS: A Contrastive Learning enhanced Large Language Model Framework for Few-Shot Named Entity Recognition

BANER: Boundary-Aware LLMs for Few-Shot Named Entity Recognition

A Unified Label-Aware Contrastive Learning Framework for Few-Shot Named Entity Recognition

Coarse-to-fine Few-shot Learning for Named Entity Recognition

Fighting Against the Repetitive Training and Sample Dependency Problem in Few-shot Named Entity Recognition

CLINER: exploring task-relevant features and label semantic for few-shot named entity recognition

Few-Shot Named Entity Recognition Via Meta-Learning (extended Abstract).

LLM-DA: Data Augmentation via Large Language Models for Few-Shot Named Entity Recognition

Local Spatial Alignment Network for Few-Shot Learning

Wide & Deep Learning for improving Named Entity Recognition via Text-Aware Named Entity Normalization

Named Entity Recognition Via Noise Aware Training Mechanism with Data Filter.

Large-Scale Label Interpretation Learning for Few-Shot Named Entity Recognition

Meta-Learning Triplet Network with Adaptive Margins for Few-Shot Named Entity Recognition

Few-shot named entity recognition with hybrid multi-prototype learning

Hybrid Multi-stage Decoding for Few-shot NER with Entity-aware Contrastive Learning

Improving few-shot named entity recognition via Semantics induced Optimal Transport

Causal Interventions-based Few-Shot Named Entity Recognition

Enhancing Low-resource Fine-grained Named Entity Recognition by Leveraging Coarse-grained Datasets