Empirical Study of Zero-Shot NER with ChatGPT

Tingyu Xie,Qi Li,Jian Zhang,Yan Zhang,Zuozhu Liu,Hongwei Wang

2023-10-16

Abstract:Large language models (LLMs) exhibited powerful capability in various natural language processing tasks. This work focuses on exploring LLM performance on zero-shot information extraction, with a focus on the ChatGPT and named entity recognition (NER) task. Inspired by the remarkable reasoning capability of LLM on symbolic and arithmetic reasoning, we adapt the prevalent reasoning methods to NER and propose reasoning strategies tailored for NER. First, we explore a decomposed question-answering paradigm by breaking down the NER task into simpler subproblems by labels. Second, we propose syntactic augmentation to stimulate the model's intermediate thinking in two ways: syntactic prompting, which encourages the model to analyze the syntactic structure itself, and tool augmentation, which provides the model with the syntactic information generated by a parsing tool. Besides, we adapt self-consistency to NER by proposing a two-stage majority voting strategy, which first votes for the most consistent mentions, then the most consistent types. The proposed methods achieve remarkable improvements for zero-shot NER across seven benchmarks, including Chinese and English datasets, and on both domain-specific and general-domain scenarios. In addition, we present a comprehensive analysis of the error types with suggestions for optimization directions. We also verify the effectiveness of the proposed methods on the few-shot setting and other LLMs.

Computation and Language

What problem does this paper attempt to address?

### Problems Addressed by the Paper This paper primarily explores how to leverage large language models (LLMs) in zero-shot Named Entity Recognition (NER) tasks. Specifically, the research focuses on: 1. **Zero-shot Information Extraction**: Investigating the performance of LLMs in zero-shot information extraction tasks, particularly for named entity recognition. 2. **Inference Strategies**: Proposing a series of inference strategies to improve LLM performance in NER tasks, including Decomposed Question-Answering, Syntactic Augmentation, and Self-Consistency Voting. 3. **Experimental Validation**: Validating the effectiveness of the proposed strategies on 7 benchmark datasets and analyzing the impact of different methods on both English and Chinese datasets. Through this research, the authors aim to reveal the potential capabilities and optimization directions of LLMs in zero-shot scenarios.

Empirical Study of Zero-Shot NER with ChatGPT

ChatIE: Zero-Shot Information Extraction via Chatting with ChatGPT

Empirical Evaluation of ChatGPT on Requirements Information Retrieval Under Zero-Shot Setting

LLM-powered Zero-shot Online Log Parsing

Is ChatGPT a General-Purpose Natural Language Processing Task Solver?

Self-Improving for Zero-Shot Named Entity Recognition with Large Language Models

Agent Instructs Large Language Models to be General Zero-Shot Reasoners

Revisiting Large Language Models as Zero-shot Relation Extractors

Rethinking Negative Instances for Generative Named Entity Recognition

Large Language Models are Zero-Shot Reasoners

ZeroNLG: Aligning and Autoencoding Domains for Zero-Shot Multimodal and Multilingual Natural Language Generation

GPT-NER: Named Entity Recognition via Large Language Models

Large Language Models Are Zero-Shot Text Classifiers

llmNER: (Zero|Few)-Shot Named Entity Recognition, Exploiting the Power of Large Language Models

Improving Zero-shot Visual Question Answering via Large Language Models with Reasoning Question Prompts

Beat LLMs at Their Own Game: Zero-Shot LLM-Generated Text Detection Via Querying ChatGPT.

Make LLMs better zero-shot reasoners: Structure-orientated autonomous reasoning

Zero-Shot Chain-of-Thought Reasoning Guided by Evolutionary Algorithms in Large Language Models

Learning from Language Description: Low-shot Named Entity Recognition via Decomposed Framework

Zero-Shot Learners for Natural Language Understanding via a Unified Multiple Choice Perspective