Selecting Social Media Responses to News: A Convex Framework Based on Data Reconstruction.
Zaiyi Chen,Linli Xu,Enhong Chen,Biao Chang,Zhefeng Wang,Yitan Li
DOI: https://doi.org/10.1137/1.9781611974010.61
2015-01-01
Abstract:Previous chapter Next chapter Full AccessProceedings Proceedings of the 2015 SIAM International Conference on Data Mining (SDM)Selecting Social Media Responses to News: A Convex Framework Based On Data ReconstructionZaiyi Chen, Linli Xu, Enhong Chen, Biao Chang, Zhefeng Wang, and Yitan LiZaiyi Chen, Linli Xu, Enhong Chen, Biao Chang, Zhefeng Wang, and Yitan Lipp.541 - 549Chapter DOI:https://doi.org/10.1137/1.9781611974010.61PDFBibTexSections ToolsAdd to favoritesExport CitationTrack CitationsEmail SectionsAboutAbstract With the explosive growth of social media, it has gained significantly increasing attention from both journalists and their readership in recent years by enhancing the reading experience with its timeliness, high participation, interactivity, etc. On the other hand, the popularity of social media services such as Twitter also leads to the challenge of information overload by generating thousands of responses (tweets) for each article of hot news, which will be overwhelming for readers. In this paper, we address the problem of selecting a representative subset of responses to news in order to deliver the most important information. We consider different criteria regarding the importance of the selected subset, and treat the problem from the data reconstruction perspective with concerns for both quality and generalizability of the selection. The intuition behind our work is that a good selection should be relevant from two levels: i) at the message level, it brings readers new information as much as possible or generalizes other people's opinions comprehensively; ii) at the text level, it is able to reconstruct the corpus. Specifically, the task of selecting responses to news can be formulated as a convex optimization problem where sparse non-negative weights are introduced for all the responses indicating whether they are selected or not. Several gradient based optimization and step size selection methods are also investigated in this paper to achieve a faster rate of convergence. More importantly, the proposed framework evaluates the utility of a set of responses jointly and therefore is able to reduce redundancy of the selected responses. We evaluate our approach on real-world data obtained from Twitter, and the results demonstrate superior performance over the state of the art in both accuracy and generalizability. Previous chapter Next chapter RelatedDetails Published:2015eISBN:978-1-61197-401-0 https://doi.org/10.1137/1.9781611974010Book Series Name:ProceedingsBook Code:PRDT145Book Pages:1-976