Multi-task Multi-attention Transformer for Generative Named Entity Recognition

Ying Mo,Jiahao Liu,Hongyin Tang,Qifan Wang,Zenglin Xu,Jingang Wang,Xiaojun Quan,Wei Wu,Zhoujun Li
DOI: https://doi.org/10.1109/taslp.2024.3458796
2024-01-01
Abstract:Most previous sequential labeling models are task-specific, while recent years have witnessed the rise of generative models due to the advantage of unifying all named entity recognition (NER) tasks into the encoder-decoder framework. Although achieving promising performance, our pilot studies demonstrate that existing generative models are ineffective at detecting entity boundaries and estimating entity types. In this paper, we propose a multi-task Transformer, which incorporates an entity boundary detection task into the named entity recognition task. More concretely, we achieve entity boundary detection by classifying the relations between tokens within the sentence. To improve the accuracy of entity-type mapping during decoding, we adopt an external knowledge base to calculate the prior entity-type distributions and then incorporate the information into the model via the self- and cross-attention mechanisms. We perform experiments on extensive NER benchmarks, including flat, nested, and discontinuous NER datasets involving long entities. It substantially increases nearly +0.3 similar to +1.5 F-1 scores across a broad spectrum or performs closely to the best generative NER model. Experimental results show that our approach improves the performance of the generative NER model considerably.
What problem does this paper attempt to address?