Study on Application of Transfer Learning in Entity Recognition of Low Resource Environment

DU Peng,ZHANG Youming,ZHU Zhengzhou,LI Guocai
DOI: https://doi.org/10.3778/j.issn.1673-9418.2107097
2023-01-01
Abstract:Entity recognition is a basic work in information extraction. At present, how to recognize entities in low resource environment is still a challenging task in natural language processing. Combined with the pre-training model, a solution of“unified coding separate decoding”is adopted, which can learn the abstract boundary information of large-scale domain entities, and transfer the abstract boundary information of entities to low resource scenarios based on transfer learning. The model can effectively improve the accuracy of entity recognition tasks in low resource environment. Different from the existing methods, the feature vector is adapted only before the process of decoding. An adaptive module is designed to decode separately each feature vector obtained by the unified coding method,according to the entity type and annotation mode dimension of the target domain, determining how each entity is dimensioned, to avoid complex entity embedding problems. Experimental results based on public datasets show that: compared with the baseline model of BERT-BiLSTM-CRF, Precision is increased by 4 percentage points, Recall is increased by 5.4 percentage points, and F1 is increased by 4.72 percentage points in the low resource scenario in the pharmaceutical field; in the low resource scenario in the personnel field, Precision is increased by 31.91 percentage points, Recall is increased by 31.7 percentage points, and F1 is increased by 31.86percentage points. Experimental results based on autonomously collected and collated datasets also show the effectiveness of the model for entity recognition in low-resource scenarios, with improved accuracy and recall compared with Lattice-BERT model.
What problem does this paper attempt to address?