LinkNER: Linking Local Named Entity Recognition Models to Large Language Models using Uncertainty

Zhen Zhang,Yuhua Zhao,Hang Gao,Mengting Hu
2024-02-27
Abstract:Named Entity Recognition (NER) serves as a fundamental task in natural language understanding, bearing direct implications for web content analysis, search engines, and information retrieval systems. Fine-tuned NER models exhibit satisfactory performance on standard NER benchmarks. However, due to limited fine-tuning data and lack of knowledge, it performs poorly on unseen entity recognition. As a result, the usability and reliability of NER models in web-related applications are compromised. Instead, Large Language Models (LLMs) like GPT-4 possess extensive external knowledge, but research indicates that they lack specialty for NER tasks. Furthermore, non-public and large-scale weights make tuning LLMs difficult. To address these challenges, we propose a framework that combines small fine-tuned models with LLMs (LinkNER) and an uncertainty-based linking strategy called RDC that enables fine-tuned models to complement black-box LLMs, achieving better performance. We experiment with both standard NER test sets and noisy social media datasets. LinkNER enhances NER task performance, notably surpassing SOTA models in robustness tests. We also quantitatively analyze the influence of key components like uncertainty estimation methods, LLMs, and in-context learning on diverse NER tasks, offering specific web-related recommendations.
Computation and Language
What problem does this paper attempt to address?
The problems that this paper attempts to solve mainly focus on the challenges encountered in the Named Entity Recognition (NER) task, especially the poor performance when dealing with unseen entities (Out - of - Vocabulary, OOV and Out - of - Domain, OOD entities). Specifically: 1. **Limitations of existing NER models**: Although the existing fine - tuned NER models perform well in standard NER benchmark tests, due to the limited fine - tuning data and lack of knowledge, these models have poor performance in recognizing unseen entities. This affects the usability and reliability of NER models in web - related applications. 2. **Advantages and disadvantages of large - language models (LLMs)**: Although large - language models such as GPT - 4 have extensive knowledge, research shows that they lack professionalism in NER tasks. In addition, due to non - public and large - scale weights, it is difficult to adjust LLMs. To address these challenges, the paper proposes a new framework - LinkNER, which combines small fine - tuned models and large - language models and introduces an uncertainty - based linking strategy (called RDC), enabling the fine - tuned model to supplement black - box LLMs, thereby achieving better performance. Through experiments, LinkNER performs excellently on both standard NER test sets and noisy social media data sets, especially significantly outperforming the current state - of - the - art (SOTA) models in robustness tests. In addition, the paper also quantitatively analyzes the impact of uncertainty estimation methods, LLMs, and context learning on different NER tasks and provides specific web - related suggestions.