Multi-modal interview concept detection for rushes exploitation

Anan Liu,Sheng Tang,Yongdong Zhang,Jintao Li,Zhaoxuan Yang
DOI: https://doi.org/10.5555/1931390.1931407
2007-01-01
Abstract:According to the concepts of Large-Scale Concept Ontology for Multimedia (LSCOM) and requirement of the 4th task in the 2006 TRECVID, i.e., rushes exploitation, the "interview" concept is an important semantic concept for rushes content analysis. The paper presents the shot-level "interview" concept detection method. Face detection and audio classification are implemented to detect "face" and "speech" concepts for each shot. By integrating audiovisual information, "interview" concept is finally detected. The utilization of the method will definitely benefit the video edit. Large-scale experimental results strongly demonstrate the accuracy and effectiveness of the proposed method.
What problem does this paper attempt to address?