A Static Video Summarization Approach Via Block-Based Self-Motivated Visual Attention Scoring Mechanism

Wen-lin Li,Tong Zhang,Xiao Liu
DOI: https://doi.org/10.1007/s13042-023-01814-9
2023-01-01
International Journal of Machine Learning and Cybernetics
Abstract:Since automatic visual semantic comprehension of video content is currently infeasible and unintelligent, key frames extracted from videos are inconsistent with human visual understanding. In this paper, a block-based self-motivated visual attention scoring mechanism named the BSVAS mechanism is proposed for extracting key frames. The approach described in this paper first reduces the dimensionality of the video by exploiting entropy as a static global characteristic measurement. Next, two block-based motion metrics are employed to express features from a spatiotemporal perspective, and a novel self-motivated strategy is applied to conduct feature fusion. Finally, a self-motivated scoring algorithm is performed to evaluate content attractiveness and frame importance to generate key frames. Experiments on gesture videos with various postures demonstrate that key frames extracted using the proposed method provide high-quality video summaries and cover the main content of the gesture videos as compared to several other excellent mechanisms in the literature.
What problem does this paper attempt to address?