BTM: Topic Modeling over Short Texts

Xueqi Cheng,Xiaohui Yan,Yanyan Lan,Jiafeng Guo
DOI: https://doi.org/10.1109/TKDE.2014.2313872
IF: 9.235
2014-01-01
IEEE Transactions on Knowledge and Data Engineering
Abstract:Short texts are popular on today's web, especially with the emergence of social media. Inferring topics from large scale short texts becomes a critical but challenging task for many content analysis tasks. Conventional topic models such as latent Dirichlet allocation (LDA) and probabilistic latent semantic analysis (PLSA) learn topics from document-level word co-occurrences by modeling each docume...
What problem does this paper attempt to address?