Content-Based Video Relevance Prediction with Second-Order Relevance and Attention Modeling

Xusong Chen,Rui Zhao,Shengjie Ma,Dong Liu,Zheng-Jun Zha
DOI: https://doi.org/10.1145/3240508.3266434
2018-01-01
Abstract:This paper describes our proposed method for the Content-Based Video Relevance Prediction (CBVRP) challenge. Our method is based on deep learning, i.e. we train a deep network to predict the relevance between two video sequences from their features. We explore the usage of second-order relevance, both in preparing training data, and in extending the deep network. Second-order relevance refers to e.g. the relevance between x and z if x is relevant to y and y is relevant to z. In our proposed method, we use second-order relevance to increase positive samples and decrease negative samples, when preparing training data. We further extend the deep network with an attention module, where the attention mechanism is designed for second-order relevant video sequences. We verify the effectiveness of our method on the validation set of the CBVRP challenge.
What problem does this paper attempt to address?