Subtraction Gates: Another Way to Learn Long-Term Dependencies in Recurrent Neural Networks

Tao He,Hua Mao,Zhang Yi
DOI: https://doi.org/10.1109/TNNLS.2020.3043752
IF: 14.255
2022-01-01
IEEE Transactions on Neural Networks and Learning Systems
Abstract:Recurrent neural networks (RNNs) can remember temporal contextual information over various time steps. The well-known gradient vanishing/explosion problem restricts the ability of RNNs to learn long-term dependencies. The gate mechanism is a well-developed method for learning long-term dependencies in long short-term memory (LSTM) models and their variants. These models usually take the multiplica...
What problem does this paper attempt to address?