Multi-level cross-modal contrastive learning for review-aware recommendation

Yibiao Wei,Yang Xu,Lei Zhu,Jingwei Ma,Chengmei Peng
DOI: https://doi.org/10.1016/j.eswa.2024.123341
IF: 8.5
2024-08-01
Expert Systems with Applications
Abstract:Recent studies tend to employ Contrastive Learning (CL) methods to facilitate model training by extracting self-supervised signals to mitigate data sparsity. However, existing CL-based recommendation methods have not fully exploited the rich semantic information present in multi-modal data. To address these limitations, we propose a new CL-based recommendation framework named Multi-level Cross-modal Contrastive Learning (MCCL), which aims to construct multi-level contrastive learning to fully exploit the intra- and inter-modal semantic information in a self-supervised manner. Specifically, we innovatively consider user interaction and semantic review as two distinct semantic modalities, and devise two modal-specific contrastive learning strategies to enhance intra-modal learning. Furthermore, we leverage the semantic consistency between modalities to construct a multi-level cross-modal contrastive learning framework. Finally, a multi-task learning method is employed for collaborative optimization across multiple tasks. We verify the efficacy of MCCL via comprehensive experiments on three real-world datasets. MCCL achieves a significant performance improvement over the state-of-the-art baseline models.
computer science, artificial intelligence,engineering, electrical & electronic,operations research & management science
What problem does this paper attempt to address?