FAC: A Music Recommendation Model Based on Fusing Audio and Chord Features (115)

Weite Feng,Junrui Liu,Tong Li,Zhen Yang,Di Wu
DOI: https://doi.org/10.1142/s0218194022500577
IF: 1.007
2022-01-01
International Journal of Software Engineering and Knowledge Engineering
Abstract:Music content has recently been identified as useful information to promote the performance of music recommendations. Existing studies usually feed low-level audio features, such as the Mel-frequency cepstral coefficients, into deep learning models for music recommendations. However, such features cannot well characterize music audios, which often contain multiple sound sources. In this paper, we propose to model and fuse chord, melody, and rhythm features to meaningfully characterize the music so as to improve the music recommendation. Specially, we use two user-based attention mechanisms to differentiate the importance of different parts of audio features and chord features. In addition, a Long Short-Term Memory layer is used to capture the sequence characteristics. Those features are fused by a multilayer perceptron and then used to make recommendations. We conducted experiments with a subset of the last.fm-1b dataset. The experimental results show that our proposal outperforms the best baseline by [Formula: see text] on HR@10.
What problem does this paper attempt to address?