Incorporating Large Language Models into Named Entity Recognition: Opportunities and Challenges
Youchen Wang,Zhizhao Luo,Zhi Fang,Yuting Yang,Yikai Guo,Rui Qi
DOI: https://doi.org/10.1109/ICCBD-AI62252.2023.00079
2023-12-15
Abstract:Named entity recognition (NER) is a critical task in natural language processing, involving the identification and classification of entities in text. Traditional approaches to NER, such as rule-based methods and supervised machine learning, faced limitations in adapting to diverse domains and languages due to the need for large labeled datasets. Large language models (LLMs) such as GPT3, which undergo vast pretraining, demonstrate the promising ability to substantially improve NER beyond supervised baselines as evidenced by over a 100% increase in F1 score on domain-specific evaluation corpora.This survey comprehensively examines the pivotal roles and methodologies of LLMs in NER. It provides a thorough analysis of two key techniques: in-context learning and fine-tuning. In-context learning enables quick NER task resolution without major model changes, while fine-tuning offers significant improvements, especially in domain-specific applications. The survey also covers essential resources, including NER corpora, LLM types, and methods of fine-tuning. It addresses challenges related to computational resources and model size. The survey concludes by outlining potential future research directions, emphasizing LLMs’ contributions to advancing NER technologies.
Linguistics,Computer Science