Abstract:Emotional design is an important development trend of interaction design. Emotional design in products plays a key role in enhancing user experience and inducing user emotional resonance. In recent years, based on the user's emotional experience, the design concept of strengthening product emotional design has become a new direction for most designers to improve their design thinking. In the emotional interaction design, the machine needs to capture the user's key information in real time, recognize the user's emotional state, and use a variety of clues to finally determine the appropriate user model. Based on this background, this research uses a deep learning mechanism for more accurate and effective emotion recognition, thereby optimizing the design of the interactive system and improving the user experience. First of all, this research discusses how to use user characteristics such as speech, facial expression, video, heartbeat, etc., to make machines more accurately recognize human emotions. Through the analysis of various characteristics, the speech is selected as the experimental material. Second, a speech-based emotion recognition method is proposed. The mel-Frequency cepstral coefficient (MFCC) of the speech signal is used as the input of the improved long and short-term memory network (ILSTM). To ensure the integrity of the information and the accuracy of the output at the next moment, ILSTM makes peephole connections in the forget gate and input gate of LSTM, and adds the unit state as input data to the threshold layer. The emotional features obtained by ILSTM are input into the attention layer, and the self-attention mechanism is used to calculate the weight of each frame of speech signal. The speech features with higher weights are used to distinguish different emotions and complete the emotion recognition of the speech signal. Experiments on the EMO-DB and CASIA datasets verify the effectiveness of the model for emotion recognition. Finally, the feasibility of emotional interaction system design is discussed.

Speech Emotion Recognition in Dyadic Dialogues with Attentive Interaction Modeling

Deep Spectrum Feature Representations for Speech Emotion Recognition

Exploring Spatio-Temporal Representations by Integrating Attention-based Bidirectional-LSTM-RNNs and FCNs for Speech Emotion Recognition

Self-attention Transfer Networks for Speech Emotion Recognition

MFDR: Multiple-stage Fusion and Dynamically Refined Network for Multimodal Emotion Recognition

Attention-Enhanced Connectionist Temporal Classification for Discrete Speech Emotion Recognition

A Contextual Attention Network for Multimodal Emotion Recognition in Conversation

Emotion Recognition via Environmental Context and Human Body

Conversational Emotion Analysis via Attention Mechanisms

Bridging the Emotional Semantic Gap via Multimodal Relevance Estimation

Emotion embedding framework with emotional self-attention mechanism for speaker recognition

AIMDiT: Modality Augmentation and Interaction via Multimodal Dimension Transformation for Emotion Recognition in Conversations

Speaker-aware cognitive network with cross-modal attention for multimodal emotion recognition in conversation

Language-guided Multi-modal Emotional Mimicry Intensity Estimation

Emotion Recognition Model Based on Multimodal Decision Fusion

Emotion Recognition for Multiple Context Awareness.

Bridging Discrete and Continuous: A Multimodal Strategy for Complex Emotion Detection

A Novel User Emotional Interaction Design Model Using Long and Short-Term Memory Networks and Deep Learning

Contextual and Cross-Modal Interaction for Multi-Modal Speech Emotion Recognition

Emotional Cues Extraction and Fusion for Multi-modal Emotion Prediction and Recognition in Conversation

Multi-Modal Attentive Prompt Learning for Few-shot Emotion Recognition in Conversations