Abstract:Twitter is one of the biggest platforms where massive instant messages (i.e. tweets) are published every day. Users tend to express their real feelings freely in Twitter, which makes it an ideal source for capturing the opinions towards various interesting topics, such as brands, products or celebrities, etc. Naturally, people may anticipate an approach to receiving the common sentiment tendency towards these topics directly rather than through reading the huge amount of tweets about them. On the other side, Hashtags, starting with a symbol "#" ahead of keywords or phrases, are widely used in tweets as coarse-grained topics. In this paper, instead of presenting the sentiment polarity of each tweet relevant to the topic, we focus our study on hashtag-level sentiment classification. This task aims to automatically generate the overall sentiment polarity for a given hashtag in a certain time period, which markedly differs from the conventional sentence-level and document-level sentiment analysis. Our investigation illustrates that three types of information is useful to address the task, including (1) sentiment polarity of tweets containing the hashtag; (2) hashtags co-occurrence relationship and (3) the literal meaning of hashtags. Consequently, in order to incorporate the first two types of information into a classification framework where hashtags can be classified collectively, we propose a novel graph model and investigate three approximate collective classification algorithms for inference. Going one step further, we show that the performance can be remarkably improved using an enhanced boosting classification setting in which we employ the literal meaning of hashtags as semi-supervised information. Experimental results on a real-life data set consisting of 29,195 tweets and 2,181 hashtags show the effectiveness of the proposed model and algorithms.

SenTopX: Benchmark for User Sentiment on Various Topics

Exploring the Distinctive Tweeting Patterns of Toxic Twitter Users

SBTM: A Joint Sentiment and Behaviour Topic Model for Online Course Discussion Forums

A deep dive into the consistently toxic 1% of Twitter

Investigating the Relationship Between User Specialization and Toxicity on Reddit: A Sentiment Analysis Approach

Understanding Longitudinal Behaviors of Toxic Accounts on Reddit

Analyzing Social Media Sentiment: Twitter as a Case Study

Impact of Sentiment Detection to Recognize Toxic and Subversive Online Comments

Twitter Sentiment Analysis Using Textual Information and Diffusion Patterns

Analyzing Public Perceptions and User Sentiments on Tweets: A Machine Learning Approach

Topic Sentiment Analysis in Twitter

Social Media Sentiment Analysis Using Twitter Dataset

TwiInsight: Discovering Topics and Sentiments from Social Media Datasets

Twits, Toxic Tweets, and Tribal Tendencies: Trends in Politically Polarized Posts on Twitter

ALONE: A Dataset for Toxic Behavior among Adolescents on Twitter

Sadness, Anger, or Anxiety: Twitter Users' Emotional Responses to Toxicity in Public Conversations

COVID-19 Twitter Dataset with Latent Topics, Sentiments and Emotions Attributes

Analyzing Toxicity in Deep Conversations: A Reddit Case Study

Tracking Patterns in Toxicity and Antisocial Behavior Over User Lifetimes on Large Social Media Platforms

Implementing Sentiment Analysis on Real-Time Twitter Data

User-sentiment topic model: refining user's topics with sentiment information