Attention distribution guided information transfer networks for recommendation in practice
Gang Sun,Yu Li,Hongfang Yu,Victor Chang
DOI: https://doi.org/10.1016/j.asoc.2020.106772
IF: 8.7
2020-12-01
Applied Soft Computing
Abstract:<p>Recently, an increasing number of deep learning-based methods have been applied in recommendation. Most such methods outperform traditional methods, especially when using the natural language processing (NLP) technique with review texts. Many deep learning-based recommender systems are used to learn latent representations of reviews written by target users and reviews written for target items. They are then combined to predict the rating of the target user for the target item. However, most previously proposed review-based deep learning methods do not conform to real-world application scenarios, in which we cannot obtain the reviews of the target user for the target item (called U2I review). In real-world recommendation settings, items are always recommended to users before they have experienced them. Therefore, the review of a target user for a target item would not be available during the testing and validation process. Many methods, such as DeepCoNN and D-ATT, do not exclude the U2I review in the process of validation and testing. Therefore, the process of testing is different from real-world application scenarios, and these methods obtain substantial valuable information from the U2I review that target users write for target items. We propose a model called ADGITN and a training strategy to solve this problem. When training, the auxiliary model learns two attention distributions that the U2I reviews over user reviews and item reviews by auxiliary tasks. These two distributions are used to guide the learning of attention distributions between user reviews and item reviews of the main model. Thus, the main model could learn how to extract attention distributions between user reviews and item reviews according to the valuable information extracted from U2I reviews. During validation, only the main model works, and it could extract better attention distributions even without the help of a U2I review. Extensive experiments show the effectiveness of our model. We validate our model on the Amazon and Yelp19 datasets, and the results show that our model outperforms existing excellent models, with up to 13.8% relative improvement compared to the performance of MPCN, which is one of the best review-based deep learning models for recommendation.</p>
computer science, artificial intelligence, interdisciplinary applications