Abstract:Sequential recommendation is a stream of studies on recommender systems, which focuses on predicting the next item a user interacts with by modeling the dynamic sequence of user-item interactions. Since being born to explore the dynamic tendency of variable-length temporal sequence, Recurrent Neural Networks (RNNs) have been paid much attention in this area. However, the inherent defects caused by the network structure of RNNs have limited their applications in sequential recommendation, which are mainly shown on two factors: RNNs tend to make point-wise predictions and ignore the collective dependencies because the temporal relationships between items change monotonically; RNNs are likely to forget the essential information during processing long sequences. To solve these problems, researchers have done much work to enhance the memory mechanism of RNNs. However, although previous RNN-based methods have achieved promising performance by taking advantage of external knowledge with other advanced techniques, the improvement of the intrinsic property of existing RNNs has not been explored, which is still challenging. Therefore, in this work, we propose a novel architecture based on Long Short-Term Memories (LSTMs), a broadly-used variant of RNNs, specific for sequential recommendation, called Long Short-Term enhanced Memory (LSTeM) , which boosts the memory mechanism of original LSTMs in two ways. Firstly, we design a new structure of gates in LSTMs by introducing a "Q-K-V" triplet, a mechanism to accurately and properly model the correlation between the current item and the user's historical behaviors at each time step. Secondly, we propose a "recover gate" to remedy the inadequacy of memory caused by the forgetting mechanism, which works with a dynamic global memory embedding. Extensive experiments have demonstrated that LSTeM achieves comparable performance to the state-of-the-art methods on the challenging datasets for sequential recommendation.

Nested LSTMs

Residual Recurrent Neural Networks for Learning Sequential Representations.

Cell-aware Stacked LSTMs for Modeling Sentences

xLSTM: Extended Long Short-Term Memory

Learning to (Learn at Test Time): RNNs with Expressive Hidden States

A Modified Long Short-Term Memory Cell

Learning Longer Memory in Recurrent Neural Networks

Neural Architectures for Nested NER through Linearization

Combining Recurrent, Convolutional, and Continuous-time Models with Linear State-Space Layers

Gated Recurrent Neural Tensor Network

Major–Minor Long Short-Term Memory for Word-Level Language Model

Learning Hierarchical Structures with Differentiable Nondeterministic Stacks

Design of Hierarchical Neural Networks Using Deep LSTM and Self-organizing Dynamical Fuzzy-Neural Network Architecture

Hierarchically Gated Recurrent Neural Network for Sequence Modeling

Learning Sparse Hidden States In Long Short-Term Memory

Recurrent Memory Networks for Language Modeling

Modeling programs hierarchically with stack-augmented LSTM

Long short-term enhanced memory for sequential recommendation

Continual Learning Long Short Term Memory.

Short-term Memory of Deep RNN