A Multi-Channel Graph Attention Network for Chinese NER.

Yichun Zhao,Kui Meng,Gongshen Liu
DOI: https://doi.org/10.1007/978-3-030-92185-9_17
2021-01-01
Abstract:Incorporating lexicons into Chinese NER via lattice inputs is proven to be effective because it can exploit both boundary and semantic information of words to achieve better results. Previous works have tried various structures such as RNN, CNN, GNN and Transformer to accommodate lattice inputs with great success. In this paper, we propose a Multi-Channel Graph Attention Network (MCGAT), which consists of three word-modified graph attention networks. MCGAT considers relative position relations between characters and words, and combines statistical information of word frequency and pointwise mutual information to further improve the performance of the model. Experiments on four datasets show that our model achieves comprehensive improvements on the original model and outperforms the state-of-the-art model on three datasets using the same lexicon. MCGAT is competitive in terms of efficiency and has a faster inference speed than the transformer-based model. We also demonstrate that MCGAT can significantly improve the performance of pre-trained models like BERT as a downstream network.
What problem does this paper attempt to address?