Incorporating Dictionary-Based Word Representation into Neural Network for Sequence Tagging

Weilin Liu
DOI: https://doi.org/10.1007/978-981-10-8944-2_52
2018-08-23
Abstract:The traditional sequence labeling systems rely on abundant specialized knowledge and handmade features. And state-of-the-art sequence tagging model requires no feature engineering and data processing by a combination of bidirectional LSTM, CNN, and CRF. In our paper, we propose a LSTM–CRF–DICT network by incorporating the word representation based on the dictionaries into the neutral network. The Trie tree is applied to the extraction of dictionary-based word features, which accelerates the retrieval efficiency. We evaluated our method on a Chinese corpus for sequence tagging with nine tags and five domain dictionaries, and the performance of sequence tagging is enhanced significantly. In addition, the introduction of dictionaries results in little increase of time assumption.
What problem does this paper attempt to address?