L-2,L-0 Constrained Sparse Dictionary Selection For Video Summarization

Shaohui Mei,Genliang Guan,Zhiyong Wang,Mingyi He,Xian-Sheng Hua,David Dagan Feng
DOI: https://doi.org/10.1109/ICME.2014.6890179
2014-01-01
Abstract:The ever increasing volume of video content has created profound challenges for developing efficient video summarization (VS) techniques to access the data. Recent developments on sparse dictionary selection have demonstrated promising results for VS, however, the convex relaxation based solution cannot ensure the sparsity of the dictionary directly and it selects keyframes in a local point of view. In this paper, an L-2,L-0 constrained sparse dictionary selection model is proposed to reformulate the problem of VS. In addition, a simultaneous orthogonal matching pursuit (SOMP) based method is proposed to obtain an approximate solution for the proposed model without smoothing the penalty function, and thus selects keyframes in a global point of view. In order to allow for intuitive and flexible configuration of VS process, a percentage of residuals (POR) criterion is also developed to produce video summaries in different lengths. Experimental results demonstrate that our proposed method outperforms the state-of-the-art.
What problem does this paper attempt to address?