Unsupervised Video Highlight Extraction via Query-related Deep Transfer

Han Wang,Huangyue Yu,Pei-xuan Chen,Rui Hua,Chuyi Yan,Ling Zou
DOI: https://doi.org/10.1109/ICPR.2018.8545808
2018-08-01
Abstract:The emergence of user-operated media motivates the explosive growth of online videos. Browsing these large amounts of videos is time-consuming and tedious, which makes finding the moments of user major or special preference (i.e. highlights extraction) becomes an urgent problem. Moreover, the user subjectivity over a video makes no fixed extraction meets all user preferences. This paper addresses these problems by posing a query-related highlight extraction framework which optimizes selected frames to both semantically query-related and visually representative of the entire video. Under this framework, relevance between the query text and the video frames is first computed on a visual-semantic feature embedding space induced by a convolutional neural network (Query-Inception network). Then we enforce the diversity on the video frames with the determinantal point process (DPP), a recently introduced probabilistic model for diverse subset selection. The experimental results show that our query-related highlight extraction method is particularly useful for news videos content fetching, e.g. showing the abstraction of the entire video while playing focus on the parts that matches the user queries.
Computer Science
What problem does this paper attempt to address?