Improving Chinese Named Entity Recognition Based on Lexical Information Adjustment

Jinshang Luo,Mengshu Hou
DOI: https://doi.org/10.1145/3577530.3577569
2022-01-01
Abstract:Lexical enhancement has been proven effective for Chinese Named Entity Recognition (NER), allowing for the use of word boundary information and the reduction of segmentation errors. Most present methods incorporate the lexical information by lattices but neglect the impact of candidate lexicon conflicts. The conflicts may misguide the model to make different predictions. In the work, a novel Long Short-Term Memory (LSTM) model based on the lexical information adjustment strategy (LIA-LSTM) is proposed. Firstly, all potential words are categorized into corresponding word sets. To lessen the influence of lexicon conflicts, the lexical features are introduced for the character sequence via elaborate weight adjustment. Then the LSTM and self-attention mechanism are applied to improve contextual awareness. Experiments on benchmark datasets demonstrate that LIA-LSTM outperforms the state-of-the-art methods compared. LIA-LSTM improves F1 score by 1.4% over baseline Lattice-LSTM on the MSRA dataset.
What problem does this paper attempt to address?