Abstract:We present the MDS feature learning framework, in which multidimensional scaling (MDS) is applied on high-level pairwise image distances to learn fixed-length vector representations of images. The aspects of the images that are captured by the learned features, which we call MDS features, completely depend on what kind of image distance measurement is employed. With properly selected semantics-sensitive image distances, the MDS features provide rich semantic information about the images that is not captured by other feature extraction techniques. In our work, we introduce the iterated Levenberg-Marquardt algorithm for solving MDS, and study the MDS feature learning with IMage Euclidean Distance (IMED) and Spatial Pyramid Matching (SPM) distance. We present experiments on both synthetic data and real images --- the publicly accessible UIUC car image dataset. The MDS features based on SPM distance achieve exceptional performance for the car recognition task.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is: how to learn a fixed - length image feature representation based on the distance metric between images through Multidimensional Scaling (MDS) technology and apply it to object recognition tasks. Specifically, the author proposes a new feature - learning framework, which uses MDS to operate on the distances between high - level image pairs to learn the fixed - length vector representation of images. These learned features are called MDS features, and the aspects of the images they capture depend entirely on the image distance metric method used. ### Core problems and methods of the paper 1. **Problem background**: - Traditional image feature extraction methods are divided into two categories: hand - designed features and feature - learning methods. Hand - designed features mainly focus on the color, texture, and gradient information of images, while feature - learning uses the pattern distribution of the entire data set to generate a fixed - length vector representation for each image. - An important challenge in feature - learning is how to effectively encode the semantic information of images into a low - dimensional feature space while maintaining computational efficiency. 2. **Proposed methods**: - **MDS feature - learning framework**: The author proposes a multidimensional scaling feature - learning framework, in which MDS is applied to the distances between high - level image pairs to learn a fixed - length vector representation. - **Iterative Levenberg - Marquardt Algorithm (ILMA)**: To solve the MDS optimization problem, the author introduces a two - stage iterative Levenberg - Marquardt algorithm (ILMA), which minimizes the raw stress in the initialization stage and the adjustment stage respectively, thus efficiently solving the MDS problem. - **Image distance metric**: The author studies two popular image distance metric methods - IMage Euclidean Distance (IMED) and Spatial Pyramid Matching (SPM) distance, and evaluates their performance in experiments. ### Experimental results - **Synthetic data experiment**: The running time and convergence of different MDS algorithms are evaluated on the Swiss roll data set, and the results show that ILMA is faster than other methods and converges to a smaller raw stress value. - **Real - image experiment**: The UIUC car image data set is used for the car - recognition task, and the performance of MDS features is compared with that of traditional dimensionality - reduction algorithms such as PCA and kernel PCA. The experimental results show that the MDS features based on the SPM distance achieve the best classification performance in all feature dimensions, especially in the low - dimensional feature space. ### Conclusion The paper demonstrates the effectiveness of the MDS feature - learning framework, especially when combined with semantically - sensitive image distance metrics (such as SPM distance), which can significantly improve the performance of object - recognition tasks. Future work can further explore more types of image distance metrics and verify the effectiveness of this method on larger - scale data sets.

Feature Learning by Multidimensional Scaling and its Applications in Object Recognition

MMSS: Multi-modal Sharable and Specific Feature Learning for RGB-D Object Recognition.

A Unified Feature Representation and Learning Framework for 3D Shape

Few-Shot Object Detection with Multi-level Information Interaction for Optical Remote Sensing Images

MDFL: Multi-domain Diffusion-driven Feature Learning

Ordinal Distance Metric Learning with MDS for Image Ranking

Robust dimensionality reduction via feature space to feature space distance metric learning.

High-Order Distance-Based Multiview Stochastic Learning in Image Classification

Learning high-level features by fusing multi-view representation of MLS point clouds for 3D object recognition in road environments

Multidimensional Scaling on Multiple Input Distance Matrices

Discriminative Multidimensional Scaling for Low-Resolution Face Recognition.

Semantic preserving distance metric learning and applications

Isometric Multi-manifold Learning for Feature Extraction

Multi-feature distance metric learning for non-rigid 3D shape retrieval

Multidomain Subspace Classification for Hyperspectral Images

A Discriminative Vectorial Framework for Multi-modal Feature Representation

Modality-specific and hierarchical feature learning for RGB-D hand-held object recognition

Image classification by multimodal subspace learning

Fortifying Centers and Edges: Multidomain Feature Learning Meets Hyperspectral Image Classification

Multi-dimensional Scaling from K-Nearest Neighbourhood Distances

3D Shape Recognition and Retrieval Based on Multi-Modality Deep Learning