Abstract:Big data generated from social media and smart mobile devices has been regarded as a key to obtain insights into human behavior and been extensively utilized for launching marketing activities. A successful marketing activity requires attracting high social popularity to their contents, since higher popularity usually indicates stronger influence, more fame and higher revenue. In this paper, we focus on the question of how to improve popularity of videos sharing on websites like YouTube in mobile computing environment. Obviously, composing high quality titles and tags is beneficial for viewers to discover videos of their interests and increase their tendency to watch more videos. However, it is not an easy task for uploaders, which is especially true since the screen is tight for most mobile devices. To this end, this paper proposes a novel hybrid method based on multi-modal content analysis that recommends keywords for video uploaders to compose titles and tags of their videos and then to gain higher popularity. The method generates candidate keywords by integrating techniques of textual semantic analysis of original tags and recognition of video content. On one hand, taking the original keywords of a video as input, the method obtains most relevant words from WordNet and related video titles gathered from the three top video sharing sites (YouTube, Yahoo Video, Bing Video). On the other hand, through recognizing video content with deep learning technology, the method extracts the entity name of video content as candidate keywords. Finally, a TF-SIM algorithm is proposed to rank the candidate keywords and the most relevant keywords are recommended to uploaders for optimizing the titles and tags of their videos. The experimental results show that the proposed method can effectively improve the social popularity of the videos as well as extend the length of video viewing time per playback.

Video Ads Content Structuring by Combining Scene Confidence Prediction and Tagging

A Multimodal Framework for Video Ads Understanding

Videoader: a video advertising system based on intelligent analysis of visual content

Multi-modal Representation Learning for Video Advertisement Content Structuring

Smart Advertising in Videos Based on Comprehensive Content Analytics

Predicting Content Similarity Via Multimodal Modeling for Video-In-Video Advertising.

Story Understanding in Video Advertisements

Multimodal Content Analysis for Effective Advertisements on YouTube

Segmentation, Categorization, and Identification of Commercial Clips from TV Streams Using Multimodal Analysis

MM-AU:Towards Multimodal Understanding of Advertisement Videos

Look, Read and Feel: Benchmarking Ads Understanding with Multimodal Multitask Learning

Automatic Understanding of Image and Video Advertisements

Ad-Net: Audio-Visual Convolutional Neural Network for Advertisement Detection In Videos

ContextIQ: A Multimodal Expert-Based Video Retrieval System for Contextual Advertising

An Intelligent Video Tag Recommendation Method for Improving Video Popularity in Mobile Computing Environment

Multi-layer multi-view topic model for classifying advertising video.

Mining Adjacent Markets from a Large-Scale Ads Video Collection for Image Advertising

The CASE Dataset of Candidate Spaces for Advert Implantation

Tree-based Text-Vision BERT for Video Search in Baidu Video Advertising

Decoding viewer emotions in video ads

ADNet: A Deep Network for Detecting Adverts