Attention-Based Recurrent Neural Network for Sequence Labeling.

Bofang Li,Tao Liu,Zhe Zhao,Xiaoyong Du
DOI: https://doi.org/10.1007/978-3-319-96890-2_28
2018-01-01
Abstract:Sequence labeling is one of the key problems in natural language processing. Recently, Recurrent Neural Network (RNN) and its variations have been widely used for this task. Despite their abilities of encoding information from long distance, in practice, one single hidden layer is still not sufficient for prediction. In this paper, we propose an attention architecture for sequence labeling, which allows RNNs to selectively focus on every useful hidden layers instead of irrelative ones. We conduct experiments on four typical sequence labeling tasks, including Part-Of-Speech Tagging (POS), Chunking, Named Entity Recognition (NER), and Slot Filling for Spoken Language Understanding (SF-SLU). Comprehensive experiments show that our attention architecture provides consistent improvements over different RNN variations.
What problem does this paper attempt to address?