Research on Bag of Audio Words Algorithm in the Violent Videos Classification

Xinghao Jiang,Tanfeng Sun,Rongjie Li,Bing Feng
DOI: https://doi.org/10.1109/cisp.2010.5648004
2010-01-01
Abstract:In this paper, a new method to identify the violent videos by the bag of audio words is introduced. The MPEG-7 audio descriptors are firstly extracted, including the low level features such as AudioSpectrumCentroid and AudioSpectrum-Spread, etc. The audio words are then built according to the MPEG-7 high level descriptor, the AudioSighnature, which is considered as the “fingerprint” of the audio stream. The support vector machine is used to classify the feature vectors into two classes, i.e. the violent and non-violent videos. The experiment results demonstrate that our method can achieve good recall accuracy.
What problem does this paper attempt to address?