Unsupervised cross-domain named entity recognition using entity-aware adversarial training

Qi Peng,Changmeng Zheng,Yi Cai,Tao Wang,Haoran Xie,Qing Li
DOI: https://doi.org/10.1016/j.neunet.2020.12.027
IF: 7.8
2021-06-01
Neural Networks
Abstract:<p>The success of neural network based methods in named entity recognition (NER) is heavily relied on abundant manual labeled data. However, these NER methods are unavailable when the data is fully-unlabeled in a new domain. To address the problem, we propose an unsupervised cross-domain model which leverages labeled data from source domain to predict entities in unlabeled target domain. To relieve the distribution divergence when transferring knowledge from source to target domain, we apply adversarial training. Furthermore, we design an entity-aware attention module to guide the adversarial training to reduce the discrepancy of entity features between different domains. Experimental results demonstrate that our model outperforms other methods and achieves state-of-the-art performance.</p>
computer science, artificial intelligence,neurosciences
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to effectively use the labeled data in the source domain to predict entities in the target domain when performing named entity recognition (NER) in a new domain and the data in the target domain is completely unlabeled. Specifically, the application of existing NER methods in new domains depends on a large amount of manually - labeled data, but in many cases, such data are unavailable, especially in some emerging technological fields. Therefore, the paper proposes an unsupervised cross - domain model. This model uses adversarial training and entity - aware attention mechanisms, aiming to reduce the distribution differences between different domains and achieve fine - grained entity feature alignment, thereby improving the entity recognition performance in the target domain. The main contributions of the paper include: 1. Proposing an entity - aware adversarial training model for unsupervised cross - domain NER tasks, which solves the limitations of existing work, including the dependence on external resources and the domain distribution shift problem. 2. Using adversarial training to reduce domain distribution shift and introducing an entity - aware attention mechanism to alleviate the problem of misaligned entity features during the adversarial training process. 3. Evaluating the proposed method through experiments on three datasets. The experimental results show that this model is superior to other existing methods. By combining adversarial training and entity - aware attention mechanisms, the paper effectively solves the key challenges in cross - domain NER tasks, that is, achieving knowledge transfer from the source domain to the target domain without additional large - scale language resources or domain dictionaries.