Abstract:Medical Named Entity Recognition (NER) is a critical task in medical text processing. But medical documents exhibit high variability in terms of language usage, abbreviations, synonyms, misspellings, and typographical errors, so the precise extraction of named entities is challenging. Although large language models (LLMs) have shown good performance in medical knowledge extraction tasks in few-shot settings, their performance is difficult to fully leverage in supervised medical named entity recognition (NER) tasks. This is because NER is a sequence labeling task, while LLMs are more suitable for tasks such as text generation. Furthermore, the structured output of NER tasks leads to a performance loss when LLMs convert it into generative text. Therefore, it is a challenging problem to utilize LLMs to improve the accuracy of medical named entity recognition tasks. On this paper, we propose a method that integrates LLM knowledge to enhance the performance of medical NER models. Firstly, we improve the structure of the LLM model to make it more adaptable to NER tasks. Secondly, we adopt the LoRA method and incorporate Chinese vocabulary information into the model training. Finally, to fully utilize the fine-tuned LLM to enhance the medical NER model, we convert the output of the LLM into a knowledge concentration matrix and inject it into the NER model. We have verify the effectiveness of our new method on the CMeEE dataset. The results demonstrate that our method can efficiently fine-tune the LLM and improve its performance. Moreover, our method can also leverage the prior knowledge of the fine-tuned LLM to enhance the BERT-based medical NER model. In addition, our method demonstrates good generalization and can tackle entity recognition tasks in other domains. We validated the superiority of our approach on the resume-zh dataset.

Active Learning for Name Entity Recognition with External Knowledge

Improving Biomedical Named Entity Recognition with a Unified Multi-Task MRC Framework

A Multi-task Approach for Machine Reading Comprehension Form Named Entity Recognition Tasks

Named Entity Recognition via Machine Reading Comprehension: A Multi-Task Learning Approach

Learning to Label with Active Learning and Reinforcement Learning.

MetaNER: Named Entity Recognition with Meta-Learning.

Knowledge-Augmented Language Model and its Application to Unsupervised Named-Entity Recognition

Complex Named Entity Recognition via Deep Multi-task Learning from Scratch.

Multi-Grained Knowledge Distillation for Named Entity Recognition

Towards Malay named entity recognition: an open-source dataset and a multi-task framework

Multi-task learning for Chinese clinical named entity recognition with external knowledge

Wide & Deep Learning for improving Named Entity Recognition via Text-Aware Named Entity Normalization

An Open-Source Dataset and A Multi-Task Model for Malay Named Entity Recognition

SCANNER: Knowledge-Enhanced Approach for Robust Multi-modal Named Entity Recognition of Unseen Entities

Multi-task Transformer with Relation-attention and Type-attention for Named Entity Recognition

A Unified MRC Framework for Named Entity Recognition

Multi-Task Learning with Contextualized Word Representations for Extented Named Entity Recognition

A Knowledge-Enhanced Medical Named Entity Recognition Method that Integrates Pre-Trained Language Models

Combining self learning and active learning for Chinese Named Entity Recognition

Pretraining Multi-modal Representations for Chinese NER Task with Cross-Modality Attention

Named entity recognition model based on Multi‐BiLSTM and competition mechanism