Scene Detection Based on Combined Audio and Visual Feature

王辰,吴玲达,老松杨
DOI: https://doi.org/10.3969/j.issn.1001-3695.2008.10.044
2008-01-01
Abstract:Video structure extraction is essential to content-based organization and retrieval of video.While many robust shot segmentation algorithms have been presented,it is still difficult to identify scene accurately.This paper presented a scheme for determining scene which clustered shots combining audio and visual features of video.In particular,this method first clustered shots with respect to each feature,and then determined scene integrating the different cluster.Results from experiments showed that this approach to be potent.
What problem does this paper attempt to address?