A Hybrid Approach To News Video Classification With Multi-Modal Features

P Wang,R Cai,Sq Yang
DOI: https://doi.org/10.1109/ICICS.2003.1292564
2003-01-01
Abstract:This paper presents a hybrid approach to the classification of news video story. Most of current works on news story classification utilize the multi-modal features in a uniform manner. However, the reliability of audio-visual confidence is much lower than that of text, which may evidently lower-down the performance of the classification. We proposed a decision strategy mainly depends on the evidence from text classifiers with extra assistance of audio-visual clues. In our approach, SVMs for text features and GMMs for audio-visual features are first built for each category and then used to compute text and audio-visual confidence vectors respectively. To make final decision, a text-biased decision strategy is proposed to combine these multi-modal confidence vectors. To validate the performance, text-based classification and SVM-based meta-classification methods are compared on large-scale news stories from TV programs, and our proposed hybrid approach achieves the best overall performance.
What problem does this paper attempt to address?