DBLSTM-based Multi-Scale Fusion for Dynamic Emotion Prediction in Music.

Xinxing Li,Jiashen Tian,Mingxing Xu,Yishuang Ning,Lianhong Cai
DOI: https://doi.org/10.1109/icme.2016.7552956
2016-01-01
Abstract:Dynamic Music Emotion Prediction is crucial to the emerging applications of music retrieval and recommendation. Considering the influence of temporal context and hierarchical structure on emotion in music, we propose a Deep Bidirectional Long Short-Term Memory (DBLSTM) based multi-scale regression method. In this method, a post-processing component is utilised for individual DBSLTM output to further enhance the ability of temporal context processing and a fusion component is to integrate the output of all DBLSTM models with different scales. In addition, we investigate how the difference of sequence length between the training and predicting phase affects the performance of DBLSTM. We conduct our experiments on a public database of Emotion in Music task at MediaEval 2015, and the result shows that our method achieves significant improvement when compared with the state-of-art methods.
What problem does this paper attempt to address?