Microblog Topic Mining Based on Hashtag

Jing LI,Jian YIN,Shaopeng LIU,Yali PAN
DOI: https://doi.org/10.3969/j.issn.1000-3428.2015.04.006
2015-01-01
Abstract:With the development of the Internet, microblog has become a major platform for people to obtain the information. In order to mine useful topic from microblog, based on the futures of microblog that having conversation tags,retweet tags and hashtags,this paper divides microblog into three kinds. They are microblogs about users’ interest, users interaction and hashtag-related. It designs a novel hashtag topic model named Hashtag Conversation Author Topic Model( HC-ATM) based on Author Topic Model( ATM) ,and uses Gibbs sampling implementation for inference of this model. Experiments on Twitter dataset show that HC-ATM outperforms the ATM and MicroBlog Latent Dirichlet Allocation( MB-LDA) in terms of both perplexity and KL-divergence. Besides,HC-ATM can mine topic distribution of different kinds of microblog effectively.
What problem does this paper attempt to address?