Chinese Named Entity Recognition Method Based on Multi-head Attention Enhancing Word Information

Ting Wang,Songze He
DOI: https://doi.org/10.1145/3548608.3559300
2022-01-01
Abstract:Chinese named entity recognition (CNER) is one of the important tasks in natural language processing. Unlike the English, Chinese lacks explicit word boundaries. Therefore, many models were designed to address this issue by incorporating word lexicon information into the CNER. However, lots of irrelevant information may be included when matching the entire lexicon for each character. Inspired by the SoftLexicon method, we propose a multi-head attention based model to simplify the introduced lexicon information to generate word-level attention vector. In this method, a word vector matched for each character is first obtained and further weighted by the relevance with the character-level vector to calculate the word-level attention vector. In this way, only the words existing in the sentence are matched, which reduces the scope of word matching. The effectiveness of this method is verified on multiple Chinese datasets.
What problem does this paper attempt to address?