Non-Verbal Voice Emotion Analysis System

shunji mitsuyoshi,fuji ren,yasuto tanaka,shingo kuroiwa
2006-01-01
Abstract:A non-verbal voice analysis system that recognizes, separates and ranks concurrent emotions in real time has potential application in various fields, yet such a system that could delineate emotion based solely on the sound of a human voice has not been successfully demonstrated before. Here, we propose a system that recognizes human emotion by means of analyzing the fundamental frequency of a human voice taken from continuous natural speech. The system detects robust fundamental frequencies and intonations by parameterizing them into pitch, power, and deviation of power. Based on these parameters, data was classified via decision-tree logic into the emotional elements of anger, joy, sorrow, and calmness. Degree of excitement was also extracted. The system was evaluated by third parties by matching the system performance to human subjective classification for each element. Results indicate that overall matching rate was 70%, and the matching rate was 86% when compared to the subjects' assessment of their own voices. Our system performance exceeded the baseline with non-verbal information, which was equivalent to human subjective assessment.This is the first report of a system to rigorously analyze human emotion in real time and we expect numerous applications.
What problem does this paper attempt to address?