Abstract:Different kinds of high-dimensional visual features can be extracted from a single image. Images can thus be treated as multiple view data when taking each type of extracted high-dimensional visual feature as a particular understanding of images. In this paper, we propose a framework of sparse unsupervised dimensionality reduction for multiple view data. The goal of our framework is to find a low-dimensional optimal consensus representation from multiple heterogeneous features by multiview learning. In this framework, we first learn low-dimensional patterns individually from each view, considering the specific statistical property of each view. We construct a low-dimensional optimal consensus representation from those learned patterns, the goal of which is to leverage the complementary nature of the multiple views. We formulate the construction of the low-dimensional consensus representation to approximate the matrix of patterns by means of a low-dimensional consensus base matrix and a loading matrix. To select the most discriminative features for the spectral embedding of multiple views, we propose to add an l(1)-norm into the loading matrix's columns and impose orthogonal constraints on the base matrix. We develop a new alternating algorithm, i.e., spectral sparse multiview embedding, to efficiently obtain the solution. Each row of the loading matrix encodes structured information corresponding to multiple patterns. In order to gain flexibility in sharing information across subsets of the views, we impose a novel structured sparsity-inducing norm penalty on the loading matrix's rows. This penalty makes the loading coefficients adaptively load shared information across subsets of the learned patterns. We call this method structured sparse multiview dimensionality reduction. Experiments on a toy benchmark image data set and two real-world Web image data sets demonstrate the effectiveness of the proposed algorithms.

Robust Multiview Feature Learning for RGB-D Image Understanding

Multi-View Correlated Feature Learning by Uncovering Shared Component.

A Multiview Learning Framework with a Linear Computational Cost

Recurrent Volume-Based 3-D Feature Fusion for Real-Time Multiview Object Pose Estimation.

Recurrent Volume-based 3D Feature Fusion for Real-time Multi-view Object Pose Estimation

Multi-view Robust Graph Representation Learning for Graph Classification

Multi-View Graph Embedding Learning for Image Co-Segmentation and Co-Localization

Learning Multiviewpoint Context-Aware Representation for RGB-D Scene Classification

MMSS: Multi-modal Sharable and Specific Feature Learning for RGB-D Object Recognition.

High-Order Distance-Based Multiview Stochastic Learning in Image Classification

Robust Dual-Graph Regularized Deep Matrix Factorization for Multi-view Clustering

A Unified Framework Based on Graph Consensus Term for Multiview Learning

Large-Margin Multi-Modal Deep Learning for RGB-D Object Recognition

Multimodal deep learning for robust RGB-D object recognition

A Multi-View Fusion Method Via Tensor Learning And Gradient Descent For Image Features

Sparse Unsupervised Dimensionality Reduction for Multiple View Data.

RGB×D: Learning Depth-Weighted RGB Patches for RGB-D Indoor Semantic Segmentation

Pairwise constraints based multiview features fusion for scene classification

Visual Understanding via Multi-Feature Shared Learning with Global Consistency

View-based 3D Object Retrieval Via Multi-Modal Graph Learning

Multi-View Matrix Decomposition: A New Scheme for Exploring Discriminative Information.