CMC-MMR: multi-modal recommendation model with cross-modal correction
YuBin Wang,HongBin Xia,Yuan Liu
DOI: https://doi.org/10.1007/s10844-024-00848-x
2024-02-21
Journal of Intelligent Information Systems
Abstract:Multi-modal recommendation using multi-modal features (e.g., image and text features) has received significant attention and has been shown to have more effective recommendation. However, there are currently the following problems with multi-modal recommendation: (1) Multi-modal recommendation often handle individual modes' raw data directly, leading to noise affecting the model's effectiveness and the failure to explore interconnections between modes; (2) Different users have different preferences. It's impractical to treat all modalities equally, as this could interfere with the model's ability to make recommendation. To address the above problems, this paper proposes a M ulti- m odal r ecommendation model with c ross- m odal c orrection (CMC-MMR). Firstly, in order to reduce the effect of noise in the raw data and to take full advantage of the relationships between modes, we designed a cross-modal correction module to denoise and correct the modes using a cross-modal correction mechanism; Secondly, the similarity between the same modalities of each item is used as a benchmark to build item-item graphs for each modality, and user-item graphs with degree-sensitive pruning strategies are also built to mine higher-order information; Finally, we designed a self-supervised task to adaptively mine user preferences for modality. We conducted comparative experiments with eleven baseline models on four real-world datasets. The experimental results show that CMC-MMR improves 6.202%, 4.975% , 6.054% and 11.368% on average on the four datasets, respectively, demonstrates the effectiveness of CMC-MMR.
computer science, information systems, artificial intelligence