Hierarchical Attention Blstm For Modeling Sentences And Documents

Xiaolei Niu,Yuexian Hou
DOI: https://doi.org/10.1007/978-3-319-70096-0_18
2017-01-01
Abstract:Recently, neural network based methods have made remarkable progresses on various Natural Language Processing (NLP) tasks. However, it is still a challenge to model both short and long texts, e.g. sentences and documents. In this paper, we propose a Hierarchical Attention Bidirectional LSTM (HA-BLSTM) to model both sentences and documents. HA-BLSTM effectively obtains a hierarchy of representations from words to phrases through the hierarchical structure. We design two attention mechanisms: local and global attention mechanisms. The local attention mechanism learns which components of a text are more important for modeling the whole text, while the global attention mechanism learns which representations of the same text are crucial. Thus, HA-BLSTM can model long documents along with short sentences. Experiments on four benchmark datasets show that our model yields a superior classification performance over a number of strong baselines.
What problem does this paper attempt to address?