Deep Collaborative Filtering Incorporating Auxiliary Multi-Media Information.

Shuang Li,Yanghui Yan,Chao Wu,Kaichuan Zhao,Yuezhi Zhou,Yaoxue Zhang
DOI: https://doi.org/10.1109/smartworld.2018.00164
2018-01-01
Abstract:Collaborative Filtering (CF) has been widely used in recommendation due to its brevity and effectiveness. However, rating sparsity or even cold-start is still an inevitable problem when facing CF. Although multimodal information can be easily found in today's website, most existing CF methods are not well designed to incorporate them appropriately. In this paper, we intend to explore the potential methods of fusing abundant auxiliary cross-media information of user and items to help enhance traditional CF. In general, we divide the auxiliary data into discrete partial features (tags, genres ...) and continuous global features (images, videos ...) for different purposes. As for discrete features, embedding is adopted to represent them. In order to merge different discrete components, we use attention mechanism to adaptively learn their respective importance in contributing to a specific rating prediction. While for the continuous features, we use feature extraction network such as CNN to exploit informative representation to help restrain item's embedding space, consequently the interpretability of factorized vectors is improved. To validate the performance of our proposed method, extensive evaluations on public MovieLens dataset are carried. Experimental results on MSE improvements demonstrate both the importance of the auxiliary information and effectiveness of the proposed approach.
What problem does this paper attempt to address?