Highlight sound effects detection in audio stream

Rui Cai,Lie Lu,Hong-Jiang Zhang,Lian-Hong Cai
DOI: https://doi.org/10.1109/ICME.2003.1221242
2003-01-01
Abstract:This paper addresses the problem of highlight sound effects detection in audio stream, which is very useful in fields of video summarization and highlight extraction. Unlike researches on audio segmentation and classification, in this domain, it just locates those highlight sound effects in audio stream. An extensible framework is proposed and in current system three sound effects are considered: laughter, applause and cheer, which are tied up with highlight events in entertainments, sports, meetings and home videos. HMMs are used to model these sound effects and a log-likelihood scores based method is used to make final decision. A sound effect attention model is also proposed to extend general audio attention model for highlight extraction and video summarization. Evaluations on a 2-hours audio database showed very encouraging results.
What problem does this paper attempt to address?