Multi-Interest Learning for Multi-Modal Paper Recommendation.

Xiaoteng Shen,Liangcai Su,Xi Xiao,Yi Li
DOI: https://doi.org/10.1109/ICASSP48485.2024.10446181
2024-01-01
Abstract:To help researchers find papers of interest quickly and accurately, paper recommendations are widely deployed. However, they face unique challenges, including the exploitation of rich multi-modal features and the modeling of complex relationships. In terms of multi-modal features, previous work has focused on textual information but ignored visual information, as paper images may be missing or their meanings are highly domain specific. On the other hand, although conventional recommendation methods are suitable for modeling the user-paper relationship, the lack of specific designs for the paper-paper relationship leads to sub-optimal results. To overcome these limitations, we propose a Multi-interest based Multi-modal paper recommendation model named TMRec. TMRec utilizes screenshots of papers as visual impressions to capture paper style features while avoiding the dilemma of missing paper images. Additionally, we have specially designed the multi-interest extraction module and the multi-level interaction module to consider both the multi-interest and the citation relationship between papers in the user behavior sequence. Compared to various strong baselines, TMRec has a relative improvement for up to 20% in terms of recall rate on real-world datasets, which demonstrates the superiority of TMRec and effectiveness of the visual impression feature.
What problem does this paper attempt to address?