Real-Time Recognition of Explosion Scenes Based on Audio-Visual Hierarchical Model

庄越挺,傅正钢,叶朝阳,吴飞
DOI: https://doi.org/10.3321/j.issn:1003-9775.2004.01.016
2004-01-01
Abstract:An audio-visual hierarchical model is used to detect explosion scenes from MPEG stream based on compressed features. First, a coarse SVM is applied to discriminate explosion and explosion-like audio from others, and then several fine-grained SVMs are used to determine explosion audio from explosion-like one. From these coarse to fine-grained SVMs, the audio explosion candidates are selected out. Because most explosion scenes have obvious visual change, the corresponding video is checked to get the final result.
What problem does this paper attempt to address?