Multi-Modal Person Identification In A Smart Environment

Hazim Kemal Ekenel,Mika Fischer,Qin Jin,Rainer Stiefelhagen
DOI: https://doi.org/10.1109/CVPR.2007.383388
2007-01-01
Abstract:In this paper, we present a detailed analysis of multimodal fusion for person identification in a smart environment. The multi-modal system consists of a videobased face recognition system and a speaker identification system. We investigated different score normalization, modality weighting and modality combination schemes during the fusion of the individual modalities. We introduced two new modality weighting schemes, namely, the cumulative ratio of correct matches (CRCM) and distance-to-second-closest (DT2ND) measures. In addition, we also assessed the effects of the well-known score normalization and classifier combination methods on the identification performance. Experimental results obtained on the CLEAR 2007 evaluation corpus, which contains audio-visual recordings from different smart rooms, show that CRCM-based modality weighting improves the correct identification rates significantly.
What problem does this paper attempt to address?