Abstract:Video summarization facilitates rapid browsing and efficient video indexing in many video browsing website applications, such as sport video highlights, dynamic video cover. In these applications, it is most important to generate user video summaries that capture interesting video content that users prefer. While many existing methods generate video summaries based on low-level features, this paper first proposes to mine large-scale Flickr images and find "interest" and "non-interest" images from Flickr for the same query to learn what is of interest to users. Unlike existing pairwise ranking-based methods for video summarization, we then propose an improved triplet deep ranking model that is easier to converge to learn the relationship between "interest" and "non-interest" Flickr images, and exploit what visual content of the original video is indeed preferred by users. In the training process, triplets (interest image p+, interest image p '+, non-interest image p '') are selected as input to train a model with three parallel deep convolutional networks. In the video summarization process, an efficient entropy-based video segmentation method is proposed for dividing the original video into segments and the visual interest scores of the segments are estimated using the trained ranking network for summarization (SumNet). Then, an optimal subset of the segments is selected to create a summary capturing interesting visual content. We evaluate and compare our method with several state-of-the-art methods, experimental results show that our method achieves an improvement over the best baseline method by 9.6% in terms of mean Average Precision (mAP) accuracy.

Be Relevant, Non-Redundant, and Timely: Deep Reinforcement Learning for Real-Time Event Summarization.

An Effective Hybrid Learning Model for Real-Time Event Summarization.

Learning User Interest with Improved Triplet Deep Ranking and Web-Image Priors for Topic-Related Video Summarization.

Automatic Document Summarization Via Deep Neural Networks

MARES: Multitask Learning Algorithm for Web-scale Real-Time Event Summarization

Real-Time Summarization of Twitter

Video Summarization through Reinforcement Learning with a 3D Spatio-Temporal U-Net

Crowd Aware Summarization of Surveillance Videos by Deep Reinforcement Learning

Abstractive text summarization model combining a hierarchical attention mechanism and multiobjective reinforcement learning

Deep Reinforced Self-Attention Masks for Abstractive Summarization (DR.SAS)

AttSum: Joint Learning of Focusing and Summarization with Neural Attention.

DQNC2S: DQN-based Cross-stream Crisis event Summarizer

Action Parsing-Driven Video Summarization Based on Reinforcement Learning

Progressive Reinforcement Learning for Video Summarization

A Novel Relational Learning-To-Rank Approach For Topic-Focused Multi-Document Summarization

Efficient and Interpretable Compressive Text Summarisation with Unsupervised Dual-Agent Reinforcement Learning

Generating Multiple-Length Summaries via Reinforcement Learning for Unsupervised Sentence Summarization

Deep Semantic and Attentive Network for Unsupervised Video Summarization

Query-oriented unsupervised multi-document summarization via deep learning model

Video Summarisation by Classification with Deep Reinforcement Learning