Injecting Categorical Labels and Syntactic Information into Biomedical NER

Sumam Francis,Marie-Francine Moens
2023-11-06
Abstract:We present a simple approach to improve biomedical named entity recognition (NER) by injecting categorical labels and Part-of-speech (POS) information into the model. We use two approaches, in the first approach, we first train a sequence-level classifier to classify the sentences into categories to obtain the sentence-level tags (categorical labels). The sequence classifier is modeled as an entailment problem by modifying the labels as a natural language template. This helps to improve the accuracy of the classifier. Further, this label information is injected into the NER model. In this paper, we demonstrate effective ways to represent and inject these labels and POS attributes into the NER model. In the second approach, we jointly learn the categorical labels and NER labels. Here we also inject the POS tags into the model to increase the syntactic context of the model. Experiments on three benchmark datasets show that incorporating categorical label information with syntactic context is quite useful and outperforms baseline BERT-based models.
Computation and Language,Information Retrieval,Machine Learning
What problem does this paper attempt to address?
The paper primarily addresses the task of Named Entity Recognition (NER) in the biomedical field and proposes two methods to improve model performance: a pipeline method and a joint training method. Specifically, the paper attempts to solve the following problems: 1. **How to utilize additional information (such as category labels and syntactic information) to improve the NER task in biomedical texts?** - The paper enhances model performance by injecting sentence-level category labels and Part-of-Speech (POS) information into the model. 2. **Exploring different ways to inject this additional information into the model to achieve optimal performance improvement.** - Various strategies were studied, including injecting attribute information into word embeddings, text representations, attention mechanisms, and named entity classifiers. 3. **Evaluating whether the proposed methods can significantly improve the accuracy of named entity recognition.** - Experiments were conducted on three standard datasets (BC2GM, BC5CDR-disease, and NCBI-disease) and compared with baseline models. The main contribution of the paper is demonstrating how to effectively utilize category labels and syntactic information to enhance the performance of biomedical NER models. Experiments show that the combined use of category labels and POS tags can effectively improve the model's accuracy in entity recognition, especially when dealing with complex entities. Additionally, the paper explores the performance differences of different methods (such as pipeline methods and joint training methods) in various scenarios.