Recurrent Neural Networks with External Addressable Long-Term and Working Memory for Learning Long-Term Dependences.

Zhibin Quan,Weili Zeng,Xuelian Li,Yandong Liu,Yunxiu Yu,Wankou Yang
DOI: https://doi.org/10.1109/tnnls.2019.2910302
IF: 14.255
2020-01-01
IEEE Transactions on Neural Networks and Learning Systems
Abstract:Learning long-term dependences (LTDs) with recurrent neural networks (RNNs) is challenging due to their limited internal memories. In this paper, we propose a new external memory architecture for RNNs called an external addressable long-term and working memory (EALWM)-augmented RNN. This architecture has two distinct advantages over existing neural external memory architectures, namely the division of the external memory into two parts-long-term memory and working memory-with both addressable and the capability to learn LTDs without suffering from vanishing gradients with necessary assumptions. The experimental results on algorithm learning, language modeling, and question answering demonstrate that the proposed neural memory architecture is promising for practical applications.
What problem does this paper attempt to address?