HNIP: Compact Deep Invariant Representations for Video Matching, Localization, and Retrieval.

Jie Lin,Ling-Yu Duan,Shiqi Wang,Yan Bai,Yihang Lou,Vijay Chandrasekhar,Tiejun Huang,Alex ChiChung Kot,Wen Gao
DOI: https://doi.org/10.1109/TMM.2017.2713410
IF: 7.3
2017-01-01
IEEE Transactions on Multimedia
Abstract:With emerging demand for large-scale video analysis, MPEG initiated the compact descriptor for video analysis (CDVA) standardization in 2014. Beyond handcrafted descriptors adopted by the current MPEG-CDVA reference model, we study the problem of deep learned global descriptors for video matching, localization, and retrieval. First, inspired by a recent invariance theory, we propose a nested invar...
What problem does this paper attempt to address?