Abstract:As of today, most movie recommendation services base their recommendations on collaborative filtering (CF) and/or content-based filtering (CBF) models that use metadata (e.g., genre or cast). In most video-on-demand and streaming services, however, new movies and TV series are continuously added. CF models are unable to make predictions in such a scenario, since the newly added videos lack interactions—a problem technically known as new item cold start (CS). Currently, the most common approach to this problem is to switch to a purely CBF method, usually by exploiting textual metadata. This approach is known to have lower accuracy than CF because it ignores useful collaborative information and relies on human-generated textual metadata, which are expensive to collect and often prone to errors. User-generated content, such as tags, can also be rare or absent in CS situations. In this paper, we introduce a new movie recommender system that addresses the new item problem in the movie domain by (i) integrating state-of-the-art audio and visual descriptors, which can be automatically extracted from video content and constitute what we call the movie genome; (ii) exploiting an effective data fusion method named canonical correlation analysis, which was successfully tested in our previous works Deldjoo et al. (in: International Conference on Electronic Commerce and Web Technologies. Springer, Berlin, pp 34–45, 2016b; Proceedings of the Twelfth ACM Conference on Recommender Systems. ACM, 2018b), to better exploit complementary information between different modalities; (iii) proposing a two-step hybrid approach which trains a CF model on warm items (items with interactions) and leverages the learned model on the movie genome to recommend cold items (items without interactions). Experimental validation is carried out using a system-centric study on a large-scale, real-world movie recommendation dataset both in an absolute cold start and in a cold to warm transition; and a user-centric online experiment measuring different subjective aspects, such as satisfaction and diversity. Results show the benefits of this approach compared to existing approaches.

Audio-visual encoding of multimedia content for enhancing movie recommendations

Enhanced movie content similarity based on textual, auditory and visual information

Contrastive Intra- and Inter-Modality Generation for Enhancing Incomplete Multimedia Recommendation

Recommender Systems Leveraging Multimedia Content

Content-Based Movie Recommendation System: An Enhanced Approach to Personalized Movie Recommendations

Movie Recommendation System using Composite Ranking

Multi-source based movie recommendation with ratings and the side information

Using Affective Features from Media Content Metadata for Better Movie Recommendations

Videoader: a video advertising system based on intelligent analysis of visual content

Multimodal Movie Recommendation System Using Deep Learning

Moviescope: Large-scale Analysis of Movies using Multiple Modalities

Exploiting Visual Contents in Posters and Still Frames for Movie Recommendation

Online Video Recommendation Based on Multimodal Fusion and Relevance Feedback

Combining semantic and linguistic representations for media recommendation

Exploiting Rich Contents for Personalized Video Recommendation.

Movie genome: alleviating new item cold start in movie recommendation

Enhancing User Experience: Advanced Techniques in Movie Recommender Systems

Video-Music Retrieval:A Dual-Path Cross-Modal Network

Multimodal Content Representation and Similarity Ranking of Movies

Personalized Video Recommendation Using Rich Contents from Videos

Robust multi-objective visual bayesian personalized ranking for multimedia recommendation