Double-Checker: Large Language Model As a Checker for Few-shot Named Entity Recognition

Wei Chen,Lili Zhao,Zhi Zheng,Tong Xu,Yang Wang,Enhong Chen
DOI: https://doi.org/10.18653/v1/2024.findings-emnlp.180
2024-01-01
Abstract:Recently, few-shot Named Entity Recognition (NER) has attracted significant attention due to the high cost of obtaining high-quality labeled data. Decomposition-based methods have demonstrated remarkable performance on this task, which initially train a type-independent span detector and subsequently classify the detected spans based on their types. However, this framework has an evident drawback as a domain-agnostic detector cannot ensure the identification of only those entity spans that are specific to the target domain. To address this issue, we propose Double-Checker, which leverages collaboration between Large Language Models (LLMs) and small models. Specifically, we employ LLMs to verify candidate spans predicted by the small model and eliminate any spans that fall outside the scope of the target domain. Extensive experiments validate the effectiveness of our method, consistently yielding improvements over two baseline approaches. Our code is available at https://github.com/fanshu6hao/Double-Checker.
What problem does this paper attempt to address?