Abstract:Recently, we have witnessed a surge of interests of learning a low-dimensional subspace for scene classification. The existing methods do not perform well since they do not consider scenes' multiple features from different views in low-dimensional subspace construction. In this paper, we describe scene images by finding a group of features and explore their complementary characteristics. We consider the problem of multiview dimensionality reduction by learning a unified low-dimensional subspace to effectively fuse these features. The new proposed method takes both intraclass and interclass geometries into consideration, as a result the discriminability is effectively preserved because it takes into account neighboring samples which have different labels. Due to the semantic gap, the fusion of multiview features still cannot achieve excellent performance of scene classification in real applications. Therefore, a user labeling procedure is introduced in our approach. Initially, a query image is provided by the user, and a group of images are retrieved by a search engine. After that, users label some images in the retrieved set as relevant or irrelevant with the query. The must-links are constructed between the relevant images, and the cannot-links are built between the irrelevant images. Finally, an alternating optimization procedure is adopted to integrate the complementary nature of different views with the user labeling information, and develop a novel multiview dimensionality reduction method for scene classification. Experiments are conducted on the real-world datasets of natural scenes and indoor scenes, and the results demonstrate that the proposed method has the best performance in scene classification. In addition, the proposed method can be applied to other classification problems. The experimental results of shape classification on Caltech 256 suggest the effectiveness of our method.

Subspace-based Multi-View Fusion for Instance-Level Image Retrieval

Exploiting Hierarchical Activations of Neural Network for Image Retrieval.

Effective Image Retrieval Via Multilinear Multi-index Fusion.

A Multi-View Fusion Method Via Tensor Learning And Gradient Descent For Image Features

Fusion of Infrared and Visible Images Via Multi-Layer Convolutional Sparse Representation

Subspace-based self-weighted multiview fusion for instance retrieval

Multi-Index Fusion Via Similarity Matrix Pooling for Image Retrieval

CCSR-Net: Unfolding Coupled Convolutional Sparse Representation for Multi-focus Image Fusion.

Multi-scale Fusion Transformer Based Weakly Supervised Hashing Learning for Instance Retrieval

Query Dependent Multiview Features Fusion for Effective Medical Image Retrieval

Adaptive multi-feature fusion via cross-entropy normalization for effective image retrieval

Multiple Level Visual Semantic Fusion Method for Image Re-Ranking.

Multi-FusNet: fusion mapping of features for fine-grained image retrieval networks

Retrieving Images by Multiple Samples via Fusing Deep Features.

Multi-View 3d Object Retrieval with Deep Embedding Network

Pairwise constraints based multiview features fusion for scene classification

Multi-Modal Image Fusion Via Sparse Representation and Multi-Scale Anisotropic Guided Measure

A Late Fusion Approach for Harnessing Multi-Cnn Model High-Level Features

SFPFusion: An Improved Vision Transformer Combining Super Feature Attention and Wavelet-Guided Pooling for Infrared and Visible Images Fusion

Multi-view scene matching with relation aware feature perception

Towards Deeper and Better Multi-view Feature Fusion for 3D Semantic Segmentation