Machines Imitating Humans

Qazi S. M. Zia-ul-Haque,Zhiliang Wang,Xueyuan Zhang
DOI: https://doi.org/10.1007/978-1-4020-8919-0_19
2009-01-01
Abstract: The authors have synthesized the emotion in the speech of robot. The modeling of emotion in speech relies on a number of parameters among others, fundamental frequency (F0) level, voice quality, or articulation precision etc. As an initial work for synthesizing emotion in speech, we utilized the three voice features provided by the TTS engine of Microsoft Speech SDK i.e. pitch, rate and volume. Speech with these parameters controlled, was generated randomly with 20 sentences for each emotion and perception by human hearers were collected.
What problem does this paper attempt to address?