Understanding the Users and Videos by Mining a Novel Danmu Dataset

Guangyi Lv,Kun Zhang,Le Wu,Enhong Chen,Tong Xu,Qi Liu,Weidong He
DOI: https://doi.org/10.1109/tbdata.2019.2950411
2022-01-01
IEEE Transactions on Big Data
Abstract:Recent years have witnessed a successful rise of the time synchronized gossiping comment, or so-called danmu combined with online videos. This new business mode has enriched communication among users by sending users’ feelings through danmus and sharing these danmus on time synchronized videos. Can danmu communication be helpful for better user behavior modeling or video analyzing? To this question, in this article, preliminary attempts are made on analysis of users and videos by introducing a Danmu dataset which is collected from a real-world danmu-enabled video sharing platform. The dataset contains 1.7 TB of videos and danmus in total across eight video categories. With a focus on the 7.9 million danmus records and 4.8 million video frames, we first perform the basic statistic analysis and high-level semantic analysis. After that, we show some of the previous work on this area, including user behavior modeling, fine-grained video understanding and labeling, video plot generation and image-enhanced semantic understanding. For each application, we also propose its possible future directions. We hope this new dataset will inspire new ideas in areas among language, multimedia, and user understanding.
What problem does this paper attempt to address?