A Named Entity Recognition Method Enhanced with Lexicon Information and Text Local Feature

Yuekun Ma,He Liu,Dezheng Zhang,Chang Gao,Yujue Liu
DOI: https://doi.org/10.17559/TV-20230121000257
2023-01-01
Tehnicki vjesnik - Technical Gazette
Abstract:At present, Named Entity Recognition (NER) is one of the fundamental tasks for extracting knowledge from traditional Chinese medicine (TCM) texts. The variability of the length of TCM entities and the characteristics of the language of TCM texts lead to ambiguity of TCM entity boundaries. In addition, better extracting and exploiting local features of text can improve the accuracy of named entity recognition. In this paper, we proposed a TCM NER model with lexicon information and text local feature enhancement of text. In this model, a lexicon is introduced to encode the characters in the text to obtain the context-sensitive global semantic representation of the text. The convolutional neural network (CNN) and gate joined collaborative attention network are used to form a text local feature extraction module to capture the important semantic features of local text. Experiments were conducted on two TCM domain datasets and the F1 values are 91.13% and 90.21% respectively.
What problem does this paper attempt to address?