A Multi-Task Approach for Improving Biomedical Named Entity Recognition by Incorporating Multi-Granularity Information.

Yiqi Tong,Yidong Chen,Xiaodong Shi
DOI: https://doi.org/10.18653/v1/2021.findings-acl.424
2021-01-01
Findings
Abstract:Neural biomedical named entity recognition (BioNER) methods usually require a large amount of annotated data, while the annotated BioNER datasets are often difficult to obtain and small in scale due to the limitations of privacy, ethics and high degree of specialization. To alleviate the lack of training samples, unlike conventional methods that only use token-level information, this paper proposes a method that simultaneously utilize the latent multi-granularity information in the dataset. Concretely, the proposed model is based on a multi-task approach, which leverages different training objectives by introducing auxiliary tasks, i.e. binary classification, multi-class and multi-token classification. Experimental results over three BioNER datasets show that the proposed model produces better performance over the BioBERT baseline and can get more than 3% improvements of F1score in low-resource scenarios. Finally, we released our code at https://github.com/ zgzjdx/MT-BioNER.
What problem does this paper attempt to address?