WeSeer: Visual Analysis for Better Information Cascade Prediction of WeChat Articles

Quan Li,Ziming Wu,Lingling Yi,Kristanto Sean N,Huamin Qu,Xiaojuan Ma
DOI: https://doi.org/10.48550/arXiv.1808.09068
2018-08-28
Abstract:Social media, such as Facebook and WeChat, empowers millions of users to create, consume, and disseminate online information on an unprecedented scale. The abundant information on social media intensifies the competition of WeChat Public Official Articles (i.e., posts) for gaining user attention due to the zero-sum nature of attention. Therefore, only a small portion of information tends to become extremely popular while the rest remains unnoticed or quickly disappears. Such a typical `long-tail' phenomenon is very common in social media. Thus, recent years have witnessed a growing interest in predicting the future trend in the popularity of social media posts and understanding the factors that influence the popularity of the posts. Nevertheless, existing predictive models either rely on cumbersome feature engineering or sophisticated parameter tuning, which are difficult to understand and improve. In this paper, we study and enhance a point process-based model by incorporating visual reasoning to support communication between the users and the predictive model for a better prediction result. The proposed system supports users to uncover the working mechanism behind the model and improve the prediction accuracy accordingly based on the insights gained. We use realistic WeChat articles to demonstrate the effectiveness of the system and verify the improved model on a large scale of WeChat articles. We also elicit and summarize the feedback from WeChat domain experts.
Human-Computer Interaction
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to predict the final spread scale of articles on the WeChat platform, especially how to predict the popularity of different types of articles more accurately. Specifically, the paper aims to solve the following problems: 1. **Requirement for feature engineering**: Existing feature - based models require a large amount of feature extraction and parameter adjustment, which is not only time - consuming but also difficult to understand and improve. 2. **Limitations of binary classification**: Most feature - based models regard spread prediction as a classification problem and distinguish "viral" articles by pre - defined thresholds. This method can only provide one - time prediction results and lacks detailed analysis of the spread trend. 3. **Insufficient sensitivity**: Existing models usually set thresholds based on historical data in a fixed time window. This practice may not be able to capture the changes in the time characteristics of different types of articles during the spread process. 4. **Difficulty in understanding the model mechanism**: The model mechanism of the feature engineering method is usually difficult to understand, resulting in difficulty in improving prediction accuracy and determining influencing factors. To overcome these problems, the paper proposes a method that combines the point process model (SEISMIC) and visual reasoning to support the communication between experts and prediction models, thereby improving the accuracy and interpretability of prediction. Specifically, the goals of the paper include: - **Research and enhance the point process model**: By introducing visual reasoning, improve the SEISMIC model so that it can predict the final spread scale of different types of WeChat articles more flexibly and reliably. - **Design a visual reasoning system**: Support the interaction between experts and the point process model, help experts understand the working mechanism of the model, and improve prediction accuracy based on the obtained insights. - **Verify the effectiveness of the method**: Through case studies and quantitative experiments, verify the effectiveness of the proposed method and compare it with the original SEISMIC model to show the superior performance of the improved model in predicting the spread scale of WeChat articles. Through these methods, the paper hopes to provide a more effective and flexible tool to help the WeChat platform better manage and recommend articles, and at the same time provide valuable references for academia and industry.