Toward emotional speaker recognition: framework and preliminary results

Yingchun Yang,Li Chen
DOI: https://doi.org/10.1007/978-3-642-35136-5_29
2012-01-01
Abstract:Besides channel and environment noises, emotion variability in speech signals has been found to be another important factor that degenerates drastically the performance of most speaker recognition systems proposed in the literature. How to make current GMM-UBM system adaptive to emotion variability is one consideration. We thus propose a framework named Deformation Compensation (DC) for emotional speaker recognition, which viewing emotion variability as deformation (some sort of distribution distortion in the feature space) and trying to take deformation compensation by making dynamic modification on the feature, model and score level. This paper reports the preliminary results which have been gained so far, including our proposed Deformation Compensation framework together with the preliminary case study on GMM-UBM.
What problem does this paper attempt to address?