Abstract:Big data generated from social media and smart mobile devices has been regarded as a key to obtain insights into human behavior and been extensively utilized for launching marketing activities. A successful marketing activity requires attracting high social popularity to their contents, since higher popularity usually indicates stronger influence, more fame and higher revenue. In this paper, we focus on the question of how to improve popularity of videos sharing on websites like YouTube in mobile computing environment. Obviously, composing high quality titles and tags is beneficial for viewers to discover videos of their interests and increase their tendency to watch more videos. However, it is not an easy task for uploaders, which is especially true since the screen is tight for most mobile devices. To this end, this paper proposes a novel hybrid method based on multi-modal content analysis that recommends keywords for video uploaders to compose titles and tags of their videos and then to gain higher popularity. The method generates candidate keywords by integrating techniques of textual semantic analysis of original tags and recognition of video content. On one hand, taking the original keywords of a video as input, the method obtains most relevant words from WordNet and related video titles gathered from the three top video sharing sites (YouTube, Yahoo Video, Bing Video). On the other hand, through recognizing video content with deep learning technology, the method extracts the entity name of video content as candidate keywords. Finally, a TF-SIM algorithm is proposed to rank the candidate keywords and the most relevant keywords are recommended to uploaders for optimizing the titles and tags of their videos. The experimental results show that the proposed method can effectively improve the social popularity of the videos as well as extend the length of video viewing time per playback.

Share-and-Chat: Achieving Human-Level Video Commenting by Search and Multi-View Embedding.

Video ChatBot

A Human-Machine Collaborative Video Summarization Framework Using Pupillary Response Signals

Video emotion analysis enhanced by recognizing emotion in video comments

Sentiment Analysis on Online Videos by Time-Sync Comments

VCMaster: Generating Diverse and Fluent Live Video Comments Based on Multimodal Contexts

Sentiment-oriented Transformer-based Variational Autoencoder Network for Live Video Commenting

Bridging Video Content And Comments: Synchronized Video Description With Temporal Summarization Of Crowdsourced Time-Sync Comments

ViCo: Engaging Video Comment Generation with Human Preference Rewards

LiveChat: Video Comment Generation from Audio-Visual Multimodal Contexts

Live Video Comment Generation Based on Surrounding Frames and Live Comments

Enhancing Multimodal Affective Analysis with Learned Live Comment Features

Emotional Video Captioning With Vision-Based Emotion Interpretation Network

An Intelligent Video Tag Recommendation Method for Improving Video Popularity in Mobile Computing Environment

Visual-Texual Emotion Analysis with Deep Coupled Video and Danmu Neural Networks

Video Highlights Detection and Summarization with Lag-Calibration based on Concept-Emotion Mapping of Crowd-sourced Time-Sync Comments

Neural Visual Social Comment on Image-Text Content

Reading the Videos: Temporal Labeling for Crowdsourced Time-Sync Videos Based on Semantic Embedding.

Infer Induced Sentiment of Comment Response to Video: A New Task, Dataset and Baseline

SimTube: Generating Simulated Video Comments through Multimodal AI and User Personas

A Visual Approach to Tracking Emotional Sentiment Dynamics in Social Network Commentaries