Content-Based Video Retrieval Using Audio and Visual Clues

WG Cheng,D Xu
DOI: https://doi.org/10.1109/tencon.2002.1181343
2002-01-01
Abstract:Content-based retrieval becomes a proper solution to handling the video database. In fact, the word video refers to both the image frames and the audio waveform contained in a video. However, there are only few approaches use the audio information. When either audio or visual information alone is not sufficient, combining audio and visual clues may resolve the ambiguities in individual modalities and thereby help to obtain more accurate answers. In this paper, we present a novel idea of integrating the audio features with visual features for video segmentation and retrieval.
What problem does this paper attempt to address?