<i>M</i><SUP>3</SUP>-IB: A Memory-Augment Multi-modal Information Bottleneck Model for Next-Item Recommendation

Yingpeng Du,Hongzhi Liu,Zhonghai Wu
DOI: https://doi.org/10.1007/978-3-031-00126-0_2
2022-01-01
Abstract:Modeling of users and items is essential for accurate recommendations. Traditional methods focused only on users' behavior data for recommendation. Several recent methods attempted to use multimodal data (e.g. items' attributes and visual features) to better model users and items. However, these methods fail to model users' dynamic and personalized preferences on different modalities. In addition, besides useful information for recommendation, the multi-modal data also contains a great deal of irrelevant and redundant information that may mislead the learning of recommendation models. To solve these problems, we propose a Memory-augment Multi-Modal Information Bottleneck method, named M-3-IB, for next item recommendation. First, we design a memory network framework to maintain modality-specific knowledge and capture users' dynamic modality-specific preferences. Second, we propose to model and fuse users' personalized preferences on different modalities with a multi-modal probabilistic graph. Then, to filter out irrelevant and redundant information in multi-modal data, we extend the information bottleneck theory from single-modal to multi-modal scenario and design a multi-modal information bottleneck (M2IB) model. Finally, we provide a variational approximation and a flexible implementation of the M2IB model for next item recommendation. Experiential results on five real-world data sets demonstrate the promise of the proposed method.
What problem does this paper attempt to address?