Fudan at MediaEval 2013: Violent Scenes Detection Using Motion Features and Part-Level Attributes.

Qi Dai,Jian Tu,Ziqiang Shi,Yu-Gang Jiang,Xiangyang Xue
2013-01-01
Abstract:The Violent Scenes Detection Task of MediaEval provides a valuable platform for algorithm evaluation and performance comparison. This is a very challenging task as there exist many forms of violent scenes, which vary significantly in their visual and auditory clues. In this notebook paper, we describe our system used in MediaEval 2013, which focuses on the use of motion-based features and part-level semantic attributes. One of the key components of the system is a set of trajectory-based motion features that have been observed effective in last year’s evaluation. We also adopt a newly developed part-level attribute feature, which consists of detection scores of object and scene parts. Our results indicate that the trajectory-based motion features can still offer very competitive performance, and the attribute feature is also helpful under several situations. In addition, temporally smoothing detection scores can lead to a significant performance gain. We conclude that a successful violent scenes detection system should use truly multimodal features, ranging from motion-based to static visual descriptors, as well as audio and attribute features.
What problem does this paper attempt to address?