Named entity recognition of Chinese electronic medical rec-ords based on adversarial training and feature fusion

Dequan Zheng,Feng Yu,Haoyu Zhang
DOI: https://doi.org/10.1145/3632971.3632983
2023-07-07
Abstract:Abstract. In the discipline of natural language processing, named entity recognition is the foundation for tasks such as information extraction, information retrieval, and knowledge graphs. This paper puts forward an entity recognition model based on adversarial training and feature fusion to address the issues of polysemy and not complete word recognition in Chinese electronic medical record named entity recognition. The above technique results in adversarial samples by infusing disturbance factors into the word embedding layer. These adversarial samples obtained are subsequently used for iterative training in order to optimize the model's parameters. Then, utilize the improved Transform encoder and Bi-GRU to extract global the field of semantics and direction information, add an attention mechanism to merge the extracted context features, and finally implement the entity labelling sequence using CRF. In addition, we use the RoBERTa-WWM pre-training model as the embedding layer of the model in order to offer character-level embedding, picking up more contextual semantic data as well as lexical information, as well as enhance entity recognition performance. Experimental results on the CCKS2017 and CCKS2019 evaluation datasets indicate that the proposed model outperforms the baseline model by 0.9% and 0.74 %, for example, in terms of F1. And comparative experiments demonstrate that the addition of adversarial training and feature fusion will improve the model's predictive ability and robustness.
Medicine,Computer Science
What problem does this paper attempt to address?