E-bidNER: A Two-stage Enhanced Named Entity Recognition for Chinese Bid Announcements

Yuhang Chen,Di Yang,Peng Wang,Cheng Gao
DOI: https://doi.org/10.1109/iaecst60924.2023.10502612
2023-01-01
Abstract:Addressing the issues of scarce corpora, semantic sparsity, and polysemy in the field of tender announcements, this paper proposes a bid announcement domain named entity recognition model based on a two-stage enhancement. The model consists of two core stages: context-aware fine-tuning (CFT) and multi-dimensional context semantic synthesis (MDCS). In the CFT stage, a domain-specific dataset is constructed, and ERNIE model is fine-tuned using a multi-stage masking strategy to acquire context-specific information in the tender announcement domain. Subsequently, in the MDCS stage, features are dynamically fused using a bidirectional long short-term memory network (BiLSTM) and a multi-head self-attention mechanism to capture the dependency relationships between complex entities in long texts. Experimental results demonstrate that our proposed model outperforms existing state-of-the-art models in named entity recognition tasks within the tender announcement domain. Additionally, ablation experiments further confirm the indispensability of each component in the two-stage strategy.
What problem does this paper attempt to address?