Semantic Analysis for Soccer Video Based on Fusion of Multimodal Features

ZHANG Yu-zhen,WEI Dai-di,WANG Jian-yu,DAI Yue-wei
DOI: https://doi.org/10.3969/j.issn.1002-137X.2010.07.067
2010-01-01
Computer Science
Abstract:This paper proposed a framework to fuse multimodal features to detect soccer highlights.First the audio stream was extracted from video and classified based on CHMM.Then according to time corresponding relationship,shoot event was detected based on the combination of goal and replay in the shots near to those including excited speech of commenter and cheer from audience,where replay was detected based on logos.For shoots scoring could be judged according to the length of excited speech and cheer and the one of replay and the caption appearance.In the shots close to those including whistles fouls could be detected based on the combination of replay appearance and the length of replay.Experiments prove the high efficiency of the proposed system.
What problem does this paper attempt to address?