Abstract:Multimedia-based recommendation is a challenging task that requires not only learning collaborative signals from user-item interaction, but also capturing modality-specific user interest clues from complex multimedia content. Though significant progress on this challenge has been made, we argue that current solutions remain limited by multimodal noise contamination. Specifically, a considerable proportion of multimedia content is irrelevant to the user preference, such as the background, overall layout, and brightness of images; the word order and semantic-free words in titles; etc. We take this irrelevant information as noise contamination to discover user preferences. Moreover, most recent research has been conducted by graph learning. This means that noise is diffused into the user and item representations with the message propagation; the contamination influence is further amplified. To tackle this problem, we develop a novel framework named Multimodal Graph Contrastive Learning (MGCL), which captures collaborative signals from interactions and uses visual and textual modalities to respectively extract modality-specific user preference clues. The key idea of MGCL involves two aspects: First, to alleviate noise contamination during graph learning, we construct three parallel graph convolution networks to independently generate three types of user and item representations, containing collaborative signals, visual preference clues, and textual preference clues. Second, to eliminate as much preference-independent noisy information as possible from the generated representations, we incorporate sufficient self-supervised signals into the model optimization with the help of contrastive learning, thus enhancing the expressiveness of the user and item representations. Extensive experiments validate the effectiveness and scalability of MGCL at https://github.com/hfutmars/MGCL .

M 3 KGR: A Momentum Contrastive Multi-Modal Knowledge Graph Learning Framework for Recommendation

Meta Concept Recommendation Based on Knowledge Graph

Multi-modal Recommendation Based on Knowledge Graph

Multi-contrastive Learning Recommendation Combined with Knowledge Graph

Multi-level Cross-view Contrastive Learning for Knowledge-aware Recommender System

Enhanced knowledge graph recommendation algorithm based on multi-level contrastive learning

MMGCL: Meta Knowledge-Enhanced Multi-view Graph Contrastive Learning for Recommendations

Exploring Multi-dimension User-Item Interactions with Attentional Knowledge Graph Neural Networks for Recommendation

Multi-Task Feature Learning for Knowledge Graph Enhanced Recommendation

MKGPC: Multimodal Knowledge Graph Propagation for Recommendation Systems

Multimodal Graph Contrastive Learning for Multimedia-Based Recommendation

Attentive Knowledge-aware Graph Convolutional Networks with Collaborative Guidance for Personalized Recommendation

Knowledge Enhancement for Contrastive Multi-Behavior Recommendation

Multi-task Feature Learning for Social Recommendation

MMKDGAT: Multi-modal Knowledge graph-aware Deep Graph Attention Network for remote sensing image recommendation

Enhanced Multi-Task Learning and Knowledge Graph-Based Recommender System

MERGE: A Modal Equilibrium Relational Graph Framework for Multi-Modal Knowledge Graph Completion

Knowledge-Aware Multi-Intent Contrastive Learning for Multi-Behavior Recommendation

Multi-knowledge enhanced graph convolution for learning resource recommendation

Multitype view of knowledge contrastive learning for recommendation

Multimodal collaborative graph for image recommendation