SGAT: Scene Graph Attention Network for Video Recommendation

Xiaomeng Wang,Tong Xu,Shiwei Wu
DOI: https://doi.org/10.1145/3591156.3591173
2023-01-01
Abstract:As a widely studied topic in recommender systems, collaborative filtering (CF) methods help users discover potential items of interest by assuming that behavioral similar users would have similar preferences on items. A recent trend is to develop models based on graph neural networks (GNNs). One limitation of existing methods is that they predict user preferences only by modeling user-item interactions, failing to identify user-item relations at the finer-grained level of semantics. For more accurate and explainable recommendation, it is compulsory to take side information into account and model user-item relations at the granularity of semantics. In this paper, we investigate the utility of semantic scene graphs in video recommendation scenario, which provide detailed, graph-based annotations of social situations depicted in movie clips. We propose a new method named Scene Graph Attention Network (SGAT), which explicitly models user-item relations at the granularity of semantics by capturing certain aspects of particular interest to the user from semantic scene graphs of historical items. Empirical results on public datasets show that SGAT significantly outperforms the state-of-the-art methods like LightGCN [2] and DGCF [3]. Further studies verify the effectiveness of SGAT and the explainability for predictions.
What problem does this paper attempt to address?