Abstract:There is a soaring interest in the news recommendation research scenario due to the information overload. To accurately capture users' interests, we propose to model multi-modal features, in addition to the news titles that are widely used in existing works, for news recommendation. Besides, existing research pays little attention to the click decision-making process in designing multi-modal modeling modules. In this work, inspired by the fact that users make their click decisions mostly based on the visual impression they perceive when browsing news, we propose to capture such visual impression information with visual-semantic modeling for news recommendation. In this paper, we refer to visual impression as the region of the news displayed on the user interface of a news application, which delivers both content and layout information to users. Specifically, we devise the local impression modeling module to simultaneously attend to decomposed details in the impression when understanding the semantic meaning of news title, which could explicitly get close to the process of users reading news. In addition, we inspect the impression from a global view and take structural information, such as the arrangement of different fields and spatial position of different words on the impression, into the modeling of multiple modalities. To accommodate the research of visual impression-aware news recommendation, we extend the text-dominated news recommendation dataset MIND by adding snapshot impression images and will release it to nourish the research field. Extensive comparisons with the state-of-the-art news recommenders along with the in-depth analyses demonstrate the effectiveness of the proposed method and the promising capability of modeling visual impressions for the content-based recommenders.

MM-Rec: Visiolinguistic Model Empowered Multimodal News Recommendation

MM-Rec: Multimodal News Recommendation

VLSNR:Vision-Linguistics Coordination Time Sequence-aware News Recommendation

Why DoWe Click: Visual Impression-aware News Recommendation

Why Do We Click

Neural News Recommendation with Attentive Multi-View Learning

Neural News Recommendation with Multi-Head Self-Attention.

News Recommendation With Word-Related Joint Topic Prediction

News Recommendation with Candidate-aware User Modeling

User Modeling with Click Preference and Reading Satisfaction for News Recommendation

Empowering News Recommendation with Pre-trained Language Models

Neural News Recommendation with Collaborative News Encoding and Structural User Encoding

NoteLLM-2: Multimodal Large Representation Models for Recommendation

Multi-level News Recommendation Via Modeling Candidate Interactions

Graph neural news recommendation based on multi-view representation learning

WG4Rec: Modeling Textual Content withWord Graph for News Recommendation

Neural News Recommendation with Heterogeneous User Behavior

MMGRec: Multimodal Generative Recommendation with Transformer Model

Two Birds with One Stone: Unified Model Learning for Both Recall and Ranking in News Recommendation

MNN4Rec: A relation-aware approach based on multi-view news network for news recommendation

Neural News Recommendation with Topic-Aware News Representation