Abstract:Named entity recognition (NER) is to identify and categorize entities in unstructured text, which serves as a fundamental task for a variety of natural language processing (NLP) applications. In particular, emerging few-shot NER methods aim to learn model parameters well with few samples and have received considerable attention. The dominant few-shot NER methods usually employ pre-trained language models (PLMs) as their basic architecture and fine-tune model parameters with few NER samples. Since the sample size is small and there are a large number of parameters in PLMs, fine-tuning may result in the parameters of PLMs being highly biased. To address this issue, this study introduces the semantic distribution distance constraints to optimize the fine-tuning process of few-shot NER models and develops a framework named Semantic Constraints on few-shot Named Entity Recognition (SCNER). Specifically, the framework formulates the general knowledge transfer of PLMs as an optimal transport procedure with a semantic prior. And, a Semantics-induced Optimal Transport (SOT) regularizer is developed to utilize the importance and similarities of tokens within sentences. SOT builds the semantic distribution of the sentence and defines the transport costs between tokens to achieve the token-level optimal transport procedures. Finally, SOT is employed as a regularization term of few-shot NER to introduce the semantic distribution distance constraint for effectively transferring general knowledge from PLMs. The experiments on four public datasets demonstrate that the proposed method significantly improves the performance of NER models in both few-shot and fully supervised scenarios. SCNER is a common framework that can be applied to a variety of models without adding additional learning parameters, and can be used to enhance the generalization ability and adaptability of various few-shot NER models.

Exploring Euphemism Detection in Few-Shot and Zero-Shot Settings

A Report on the Euphemisms Detection Shared Task

EUREKA: EUphemism Recognition Enhanced through Knn-based methods and Augmentation

Euphemistic Phrase Detection by Masked Language Model

MEDs for PETs: Multilingual Euphemism Disambiguation for Potentially Euphemistic Terms

TEDB System Description to a Shared Task on Euphemism Detection 2022

Impromptu Cybercrime Euphemism Detection

Turkish Delights: a Dataset on Turkish Euphemisms

FEED PETs: Further Experimentation and Expansion on the Disambiguation of Potentially Euphemistic Terms

Empirical Study of Zero-Shot NER with ChatGPT

CATs are Fuzzy PETs: A Corpus and Analysis of Potentially Euphemistic Terms

Zero and Few-shot Semantic Parsing with Ambiguous Inputs

Zero-Shot Stance Detection: A Dataset and Model using Generalized Topic Representations

A Monte Carlo Language Model Pipeline for Zero-Shot Sociopolitical Event Extraction

A Closer Look at Few-Shot Crosslingual Transfer: The Choice of Shots Matters

Does BERT Make Any Sense? Interpretable Word Sense Disambiguation with Contextualized Embeddings

Translate to Disambiguate: Zero-shot Multilingual Word Sense Disambiguation with Pretrained Language Models

Evaluating and explaining training strategies for zero-shot cross-lingual news sentiment analysis

Improving few-shot named entity recognition via Semantics induced Optimal Transport

drsphelps at SemEval-2022 Task 2: Learning idiom representations using BERTRAM

GE2E-KWS: Generalized End-to-End Training and Evaluation for Zero-shot Keyword Spotting