Boosting LLMS with Ontology-Aware Prompt for Ner Data Augmentation

Youchen Wang,Wenjun Ke,Zhizhao Luo,Peng Wang,Rui Qi,Yikai Guo
DOI: https://doi.org/10.1109/ICASSP48485.2024.10446860
2024-04-14
Abstract:Named Entity Recognition (NER) data augmentation (DA) aims to improve the performance and generalization capabilities of NER models by generating scalable training data. The key challenge lies in ensuring the generated samples maintain contextual diversity while preserving label consistency. However, existing dominant methods fail to simultaneously satisfy both criteria. Inspired by the extensive generative capabilities of large language models (LLMs), we propose ANGEL, a frAmework integrating the oNtoloGy structure and instructivE prompting within LLMs. Specifically, the hierarchical ontology structure guides prompt ranking, while instructive prompting enhances LLMs’ mastery of domain knowledge, empowering synthetic sample generation and annotation. Experiments show ANGEL surpasses state-of-the-art (SOTA) baselines, conferring absolute F1 increases of 2.86% and 0.93% on two benchmark datasets, respectively.
Computer Science
What problem does this paper attempt to address?