Emotional speaker recognition based on similar neighbor phenomenon

陈力,杨莹春
DOI: https://doi.org/10.3785/j.issn.1008-973X.2012.10.009
2012-01-01
Abstract:Based on the research on phonetics, the assumption that similar-sounding speakers in neutral condition also sound similar when they change their emotions was proposed, known as Similar Neighbor Phenomenon. Additionally, the qualitative and quantitative analysis was conducted to prove the assumption. The 'neighbors' of neutral and emotional model of the similar speaker are almost the same under the identical phonetic event. The emotional model synthesis method was proposed in order to overcome the problem that the distribution of acoustic feature under emotional states was different from that of the neutral speaker model. The method can learn the neutral-emotion transformation rules from the development corpus, and apply them into the evaluation corpus to construct the emotional speaker model from his/her neutral one. From the view of Similar Neighbor Phenomenon, neighbors under neutral were selected by the KL distance. The emotional models were constructed by the neighbors-based transformation method and shift-based transformation method. The experiments carried on MASC showed an identification rate (IR) increase of 2.81% over the GMM-UBM algorithm and 1.3% over the emotional attribute projection (EAP) algorithm.
What problem does this paper attempt to address?