Acoustics, Content and Geo-Information Based Sentiment Prediction from Large-Scale Networked Voice Data

Zhu Ren,Jia,Quan Guo,Kuo Zhang,Lianhong Cai
DOI: https://doi.org/10.1109/icme.2014.6890151
2014-01-01
Abstract:Sentiment analysis from large-scale networked data attracts increasing attention in recent years. Most previous works on sentiment prediction mainly focus on text or image data. However, voice is the most natural and direct way to express people's sentiments in real-time. With the rapid development of smart phone voice dialogue applications (e.g., Siri and Sogou Voice Assistant), the large-scale networked voice data can help us better quantitatively understand the sentimental world we live in. In this paper, we study the problem of sentiment prediction from large-scale networked voice data. In particular, we first investigate the data observations and underlying sentiment patterns in human-mobile voice communication. Then we propose a deep sparse neural network (DSNN) model to incorporate acoustic features, content information and geo-information to automatically predict sentiments. The effectiveness of the proposed model is verified by the experiments on a real dataset from Sogou Voice Assistant application.
What problem does this paper attempt to address?