The Emotional Voices Database: Towards Controlling the Emotion Dimension in Voice Generation Systems

Adaeze Adigwe,Noé Tits,Kevin El Haddad,Sarah Ostadabbas,Thierry Dutoit
DOI: https://doi.org/10.48550/arXiv.1806.09514
2018-06-25
Abstract:In this paper, we present a database of emotional speech intended to be open-sourced and used for synthesis and generation purpose. It contains data for male and female actors in English and a male actor in French. The database covers 5 emotion classes so it could be suitable to build synthesis and voice transformation systems with the potential to control the emotional dimension in a continuous way. We show the data's efficiency by building a simple MLP system converting neutral to angry speech style and evaluate it via a CMOS perception test. Even though the system is a very simple one, the test show the efficiency of the data which is promising for future work.
Computation and Language,Artificial Intelligence,Audio and Speech Processing
What problem does this paper attempt to address?