Abstract:At present, dictionary based models have been widely used in image classification. The image features are approximated as a linear combination of bases selected from the dictionary in a sparse space, resulting in compact patterns. The features applied to image classification usually reside on low dimensional manifolds embedded in a high dimensional ambient space; traditional sparse coding algorithm, however, does not consider this topological structure. It can be characterized naturally by linear coefficients that reconstruct each data point from its neighbors. One of the central issues here is how to determine the neighbors and learn the coefficients. In this paper, the geometrical structures are encoded in two situations. In simple cases when data points distribute on a single manifold, it is explicitly modeled by locally linear embedding algorithm combined with k-nearest neighbors. Nevertheless, in real-world scenarios, complex data points often lie on multiple manifolds. Sparse representation algorithm combined with k-nearest neighbors is instead utilized to construct the topological structures, because it is capable of approximating the data point by selecting its homogenous neighbors adaptively to guarantee the smoothness of each manifold. After obtaining the local fitting relationship, these two topological structures are then embedded into sparse coding algorithm as regularization terms to formulate the corresponding objective functions of dictionary learning on single manifold (DLSM) and dictionary learning on multiple manifolds (DLMM), respectively. Upon this, a coordinate descent scheme is proposed to solve the unified optimization problems. Experimental results on several benchmark data sets, such as Caltech-256, Caltech-101, Scene 15, and UIUC-Sports, show that our proposed algorithms equal or outperform other state-of-the-art image classification algorithms.

Multi-Manifold Sparse Graph Embedding for Multi-Modal Image Classification

Learning Visually Aligned Semantic Graph for Cross-Modal Manifold Matching.

Multi-View Graph Embedding Learning for Image Co-Segmentation and Co-Localization

Manifold-based multi-graph embedding for semi-supervised classification

Graph Embedding Multi-Kernel Metric Learning for Image Set Classification With Grassmannian Manifold-Valued Features

Geodesic Based Semi-supervised Multi-manifold Feature Extraction

Simultaneous Sparse Graph Embedding for Hyperspectral Image Classification

Multi-Manifold Deep Metric Learning For Image Set Classification

Neighbourhood Sensitive Preserving Embedding for Pattern Classification.

Semi-Supervised Graph Based Embedding with Non-Convex Sparse Coding Techniques

Unsupervised Multi-Class Co-Segmentation via Joint-Cut Over $L_{1}$ -Manifold Hyper-Graph of Discriminative Image Regions

Grassmannian regularized structured multi-view embedding for image classification.

Discriminative Sparse Coding on Multi-Manifold for Data Representation and Classification

Shared feature extraction for semi-supervised image classification.

Graph Embedding Learning for Cross-Modal Information Retrieval.

Learning Dictionary on Manifolds for Image Classification

MS-IMAP -- A Multi-Scale Graph Embedding Approach for Interpretable Manifold Learning

Learning transformer-based heterogeneously salient graph representation for multimodal remote sensing image classification

Deep Multi-Graph Hierarchical Enhanced Semantic Representation for Cross-Modal Retrieval

Multi-view image clustering based on sparse coding and manifold consensus

Person re-identification via semi-supervised adaptive graph embedding