Gossiping the Videos: An Embedding-Based Generative Adversarial Framework for Time-Sync Comments Generation

Guangyi Lv,Tong Xu,Qi Liu,Enhong Chen,Weidong He,Mingxiao An,Zhongming Chen
DOI: https://doi.org/10.1007/978-3-030-16142-2_32
2019-01-01
Abstract:Recent years have witnessed the successful rise of the time-sync “gossiping comment”, or so-called “Danmu” combined with online videos. Along this line, automatic generation of Danmus may attract users with better interactions. However, this task could be extremely challenging due to the difficulties of informal expressions and “semantic gap” between text and videos, as Danmus are usually not straightforward descriptions for the videos, but subjective and diverse expressions. To that end, in this paper, we propose a novel Embedding-based Generative Adversarial (E-GA) framework to generate time-sync video comments with “gossiping” behavior. Specifically, we first model the informal styles of comments via semantic embedding inspired by variational autoencoders (VAE), and then generate Danmus in a generatively adversarial way to deal with the gap between visual and textual content. Extensive experiments on a large-scale real-world dataset demonstrate the effectiveness of our E-GA framework.
What problem does this paper attempt to address?