Abstract:With the continuous development of the research in the field of emotion analysis, music, as a common multimodal information carrier in people's daily life, often transmits emotion through lyrics and melody, so it has been gradually incorporated into the research category of emotion analysis. The fusion classification model based on CNN-LSTM proposed in this paper effectively improves the accuracy of emotional classification of audio and lyrics. At the same time, in view of the problem that the traditional decision-level fusion method ignores the correlation between modes and the limitations of dataset, this paper further improves the existing Thayer dimension emotional decision fusion method, takes the audio energy axis data as the main discrimination basis, and improves the accuracy of decision fusion classification. Based on the results of music emotion analysis, this paper further carries out the task of music generation. Based on the feature that there is often consistent emotional expression between music words and songs, a dual Seq2Seq framework based on reinforcement learning is constructed. By introducing the reward value of emotional consistency and content fidelity, the output melody has the same emotion with the input lyrics and good results are achieved. Compared with the ordinary Seq2Seq, the accuracy of our proposed model is improved by about 1.1%. This shows that the accuracy of the model can be effectively improved by using reinforcement learning.

A LDA-based approach to lyric emotion regression

Modelling Emotion Dynamics in Song Lyrics with State Space Models

Lyric-based Song Emotion Detection with Affective Lexicon and Fuzzy Clustering Method.

Deep learning based mood tagging for Chinese song lyrics

Enhance Popular Music Emotion Regression by Importing Structure Information

A Deep Bidirectional Long Short-Term Memory Based Multi-Scale Approach for Music Dynamic Emotion Prediction

Research on Music Emotional Expression Based on Reinforcement Learning and Multimodal Information

AER-LLM: Ambiguity-aware Emotion Recognition Leveraging Large Language Models

Revise, Reason, and Recognize: LLM-Based Emotion Recognition via Emotion-Specific Prompts and ASR Error Correction

The Contribution of Lyrics and Acoustics to Collaborative Understanding of Mood

Emotion-Conditioned Melody Harmonization with Hierarchical Variational Autoencoder

Song Emotion Classification of Lyrics with Out-of-Domain Data under Label Scarcity

ReLyMe: Improving Lyric-to-Melody Generation by Incorporating Lyric-Melody Relationships

Exploiting Synchronized Lyrics And Vocal Features For Music Emotion Detection

Emotion Analysis of Songs Based on Lyrical and Audio Features

Transformer-based approach towards music emotion recognition from lyrics

Exploration of Music Emotion Recognition Based on MIDI.

Joint sentiment analysis of lyrics and audio in music

Improvement and Implementation of a Speech Emotion Recognition Model Based on Dual-Layer LSTM

Towards Emotion-Based Synthetic Consciousness: Using LLMs to Estimate Emotion Probability Vectors

Fine-Grained Emotion Strength Transfer, Control and Prediction for Emotional Speech Synthesis