Abstract:With the rapid development of technologies for fast Internet access and the popularization of digital cameras, an enormous number of digital images are posted and shared online everyday. Web images are usually organized by topic and are often assigned appropriate topic-related textual descriptions. Given a large set of images along with the corresponding texts, a challenging problem is how to utilize the available information to efficiently and effectively perform image retrieval tasks, such as image classification and image clustering. Previous approaches on image categorization focus on either adopting text or image features, or simply combining these two types of information together. In this paper, we improve our previously reported two multi-view classification approaches—( Dynamic Weighting and Region-based Semantic Concept Integration ) for categorizing the images under the “supervision” of topic-related textual descriptions—by proposing a novel multimedia information fusion framework , in which these two proposed methods are seamlessly integrated by analyzing the special characteristics of different images. Notice that, the proposed framework is a generic multimedia information fusion framework which is not limited to our previously reported two approaches, and it can also be used to integrate other existing multi-view classification methods or models. Also, our proposed framework is capable of handling the large scale image categorization. Specifically, the proposed framework can automatically choose an appropriate classification model for each testing image according to its special characteristics and consequently achieve better classification performance with relatively less computation time for large scale datasets; Moreover, it is able to categorize images without any textual description in real world applications. Empirical experiments on two different types of web image datasets demonstrate the efficacy and efficiency of our proposed classification framework.

A multimedia information fusion framework for web image categorization

A Unified Image Fusion Framework with Flexible Bilevel Paradigm Integration

A multimodal framework for unsupervised feature fusion

Web Multimedia Object Clustering via Information Fusion

Exploring Interaction Between Images and Texts for Web Image Categorization.

Where Elegance Meets Precision: Towards a Compact, Automatic, and Flexible Framework for Multi-modality Image Fusion and Applications

Multi-Sensor Image Fusion: A Survey of the State of the Art

Bridging the Gap between Multi-focus and Multi-modal: A Focused Integration Framework for Multi-modal Image Fusion

A Multi-Modal Image Fusion Framework Based on Guided Filter and Sparse Representation

A Multi-scale Information Integration Framework for Infrared and Visible Image Fusion

A Short Video Classification Framework Based on Cross-Modal Fusion

Multimodal Fusion for Image and Text Classification with Feature Selection and Dimension Reduction

A General Image Fusion Framework Using Multi-Task Semi-Supervised Learning

A Multi-Weight Fusion Framework for Infrared and Visible Image Fusion

Multimodal information fusion for selected multimedia applications

An image fusion algorithm based on image clustering theory

A unified multimodal classification framework based on deep metric learning

Multi-scale infrared and visible image fusion framework based on dual partial differential equations

AFDFusion: an Adaptive Frequency Decoupling Fusion Network for Multi-Modality Image

Visual and textual fusion for semantically supervised region-based retrieval

Multi-Sensor Image Fusion Using Optimized Support Vector Machine and Multiscale Weighted Principal Component Analysis