Quantifying Controversy from Stance, Sentiment, Offensiveness and Sarcasm: a Fine-Grained Controversy Intensity Measurement Framework on a Chinese Dataset

Haiyang Wang,Ye Wang,Xin Song,Bin Zhou,Xuechen Zhao,Feng Xie
DOI: https://doi.org/10.1007/s11280-023-01191-x
2023-01-01
World Wide Web
Abstract:Controversy measurement on social media plays an important part in understanding public opinion. Various topics are frequently hotly debated on social media platforms including Twitter and Sina Weibo. People sometimes use offensive or sarcastic language to convey their opinions about a topic or a source post, which might spark heated discussions and controversy towards related topics. Recent researches take controversy detection as a binary classification problem with two labels: controversy or non-controversy. The reason might be lacking a comprehensive understanding of why the controversy happened and a specific imagination of how it will be used in the downstream tasks. However, we believe that the degree of controversy courted by posts or topics in a real scenario varied. And fine-grained measurement of controversy will be beneficial to public sentiment identification, influence assessment and other social network analysis tasks. We also notice that the existing benchmarks of controversy detection are not applicable for fine-grained topic-level controversy measurement. In this paper, we present ProsCons , a large-scale comprehensive Chinese dataset that includes 245 topics and 32,667 posts with pro , con or neutral stances. Based on that, we design a controversy measurement framework for measuring the controversy intensity that topics sparked. This framework considers the degree of antagonism in terms of stance and sentiment, as well as the irrational degree (offensive or sarcasm) of a post to compute a controversy intensity. ProsCons provides a new benchmark for Chinese stance detection, offensive language and sarcasm detection, contributing to the multi-task learning of them. We conduct extensive experiments on ProsCons and provide baselines for these tasks. The experimental results highlight the challenges of the aforementioned tasks based on the ProsCons.
What problem does this paper attempt to address?