Abstract:Feature selection is a crucial step in the development of a system for identifying emotions in speech. Recently, the interaction between features generated from the same audio source was rarely considered, which may produce redundant features and increase the computational costs. To solve this problem, feature selection method based on correlation analysis and Fisher is proposed, which can remove the redundant features that have close correlations with each other. To improve the recognition performance of the feature subset after proposal feature selection further, an emotion recognition method based on extreme learning machine (ELM) decision tree is proposed according to the confusion degree among different basic emotions. A framework of speech emotion recognition is proposed and the classification experiments based on proposed classification method by using Chinese speech database from institute of automation of Chinese academy of sciences (CASIA) are performed. And the experimental results show that the proposal achieved 89.6% recognition rate on average. By proposal, it would be fast and efficient to discriminate emotional states of different speakers from speech, and it would make it possible to realize the interaction between speaker-independent and computer/robot in the future.

Improving Speaker Recognition by Training on Emotion-Added Models

Emotion-State conversion for speaker recognition

Emotional Speech Clustering Based Robust Speaker Recognition System

Emotional Speaker Identification By Humans And Machines

Exploring Spatio-Temporal Representations by Integrating Attention-based Bidirectional-LSTM-RNNs and FCNs for Speech Emotion Recognition

Applying difference detection and pruning to emotional speaker recognition

Cost-Sensitive Learning for Emotion Robust Speaker Recognition

Emotional speaker recognition based on similar neighbor phenomenon

Pitch envelope based frame level score reweighed algorithm for emotion robust speaker recognition.

Emotional Speaker Recognition Based on Model Space Migration through Translated Learning.

Simplified Deformation Compensation for Emotional Speaker Recognition

Scores Selection for Emotional Speaker Recognition

Natural-Emotion Gmm Transformation Algorithm For Emotional Speaker Recognition

Applying Emotional Factor Analysis And I-Vector To Emotional Speaker Recognition

Toward emotional speaker recognition: framework and preliminary results

A Preliminary Study on GMM Weight Transformation for Emotional Speaker Recognition

Speech Emotion Recognition Based on Feature Selection and Extreme Learning Machine Decision Tree

Emotion-Detecting Based Model Selection For Emotional Speech Recognition

Affect-insensitive Speaker Recognition Systems Via Emotional Speech Clustering Using Prosodic Features

Speech Emotion Recognition Based on Linear Discriminant Analysis and Support Vector Machine Decision Tree

Improvement and Implementation of a Speech Emotion Recognition Model Based on Dual-Layer LSTM