Survey on Video Temporal Segmentation

朱曦,林行刚
DOI: https://doi.org/10.3321/j.issn:0254-4164.2004.08.004
2004-01-01
Jisuanji Xuebao/Chinese Journal of Computers
Abstract:Video temporal segmentation denotes partitioning a video sequence into shots which is the first step toward video content analysis and content-based video browsing and retrieval. A video shot is defined as a series of inter-related consecutive frames taken contiguously by a single camera. Video shots are also considered to be the primitives for higher level content analysis, indexing, and classification. Starting from a brief description of the structure of video and the classification of video shots, the paper generalize the common methods to extract features and build metrics for computing the discontinuity values. The characteristics of the features are presented in details and compared with each other. Afterwards the paper reviews the algorithms for detecting the cut and gradual shot changes and their advantages and drawbacks. The approaches to determine the adaptive threshold and the statistical methods are described respectively and a newly proposed inertia-based cut detection algorithm is also introduced here. Then the problems of finding shot boundaries in compressed domain are analyzed which include how to make use of the information of compressed video and how to eliminate the influences of sensitive factors.
What problem does this paper attempt to address?