A Novel Chinese Resume Named Entity Recognition Model Based on Lexical Enhancement.

Jinshang Luo,Ying Liu,Mengshu Hou
DOI: https://doi.org/10.1145/3581807.3581856
2022-01-01
Abstract:The resume's popularity on the Internet has greatly increased with the development of the communication form. It is a concern of researchers to analyze the resumes of job applicants using the Named Entity Recognition (NER) method. The difficulty of Chinese Resume NER rests with word segmentation ambiguity and domain knowledge complexity. To tackle the issue, a novel lexical enhancement Long Short-Term Memory (LSTM) model with the average encoding strategy (LEAE-LSTM) is proposed. First, through the pre-trained models, the representations of characters and words are encoded separately. The lexical features with complementary information are introduced for the character sequence by matching the lexicon. Furthermore, to improve contextual awareness, the multi-metadata embeddings are combined as the input of the LSTM layer. The sentence's implicit correlations are picked up by the self-attention mechanism. Experiments on the benchmark resume dataset demonstrate that LEAE-LSTM surpasses other state-of-the-art methods. For the Chinese resume dataset, LEAE-LSTM gains a 1.8% improvement in F1 score over the baseline model Lattice LSTM.
What problem does this paper attempt to address?