Speech Emotion Recognition Using Acoustic Features

JIANG Danning,CAI Lianhong
DOI: https://doi.org/10.3321/j.issn:1000-0054.2006.01.023
2006-01-01
Abstract:A speech emotion recognition algorithm was developed based on the statistical and temporal features of the acoustic parameters for discriminating between emotions.The system first extracted the basic prosody parameters and spectral parameters,then used a PNN(probabilistic neural network) to model the statistic features and a HMM(hidden Markov model) to model the temporal features.The sum and product rules were used to combine the probabilities from each group of features for the final decision.Experiments on the Chinese speech corpus showed how the statistical and temporal features tend to reflect different aspects of emotions.The accuracy rate obtained by feature combination is higher than that by each group alone,reaching a maximum of(92.9%.)
What problem does this paper attempt to address?