Music emotion prediction based on multidimensional feature fusion

Xuanye Zhou,Zhibin Su,Hui Ren
DOI: https://doi.org/10.1109/ICIBA56860.2023.10165361
2023-01-01
Abstract:Audience sentiment analysis is a popular research task in Music Information Retrieval (MIR). In this work, we aim to lean the prediction model of fine-grained music emotion for intelligent editing and retrieval. To satisfy the properties of time-series of music, the RNN-based model of BILSTM was selected as the bone. In particular, the traditional features and deep features extracted from the CNN architecture of spectrogram was fused with attention mechanism to enhance the recognition ability for different emotions. The ablation experiment has proved the necessary of the multi-dimensional features. Finally, the contrasting experiments on recent algorithms demonstrate that our proposed method for emotion prediction is effective, which is also suitable for small data sets.
What problem does this paper attempt to address?