Multimodal fusion for multimedia analysis: a survey

Pradeep K. Atrey,M. Anwar Hossain,Abdulmotaleb El Saddik,Mohan S. Kankanhalli
DOI: https://doi.org/10.1007/s00530-010-0182-0
IF: 3.9
2010-04-04
Multimedia Systems
Abstract:This survey aims at providing multimedia researchers with a state-of-the-art overview of fusion strategies, which are used for combining multiple modalities in order to accomplish various multimedia analysis tasks. The existing literature on multimodal fusion research is presented through several classifications based on the fusion methodology and the level of fusion (feature, decision, and hybrid). The fusion methods are described from the perspective of the basic concept, advantages, weaknesses, and their usage in various analysis tasks as reported in the literature. Moreover, several distinctive issues that influence a multimodal fusion process such as, the use of correlation and independence, confidence level, contextual information, synchronization between different modalities, and the optimal modality selection are also highlighted. Finally, we present the open issues for further research in the area of multimodal fusion.
computer science, information systems, theory & methods
What problem does this paper attempt to address?