HDMacBERT-FGM-CRF Model-Based Chinese Event Element Extraction

Peng Chen,Baohua Qiang,Yufeng Wang,Chen Wang,Ruidong Chen,Baolian Li,Bo Peng,Xiangyu Zhou
DOI: https://doi.org/10.1007/978-981-16-8430-2_36
2022-01-01
Abstract:Because of the inaccuracy of event element extraction caused by long text, the errors of pre-training and fine-tuning stages, and the single sample of semantic features in the current Chinese event element extraction model, Hierarchical Decomposition MacBERT and Fast Gradient Method-based Chinese event element extraction algorithm by jointing Conditional Random Field (HDMacBERT-FGM-CRF) is proposed. This model extends absolute location encoding based on hierarchical decomposition so that the HDMacBERT model can directly process more than 512 Token texts. The HDMacBERT model based on the Mask strategy of similar words is used to mitigate errors. The FGM-based approach implements adversarial networks and increases the diversity of semantic feature samples. The P, R and \({\mathrm{F}}_{1}\) of the HDMacBERT-FGM-CRF model proposed in this paper are 2.52%, 4.03% and 3.26% higher than those of the classical Chinese event element extraction model BERT-CRF on the DuEE dataset. The experimental results show that the proposed model can learn more comprehensive deep-level semantic information of text and improve Chinese event element extraction.
What problem does this paper attempt to address?