Feature Learning by Multidimensional Scaling and its Applications in Object Recognition

Quan Wang,Kim L. Boyer
DOI: https://doi.org/10.48550/arXiv.1306.3294
2013-06-14
Abstract:We present the MDS feature learning framework, in which multidimensional scaling (MDS) is applied on high-level pairwise image distances to learn fixed-length vector representations of images. The aspects of the images that are captured by the learned features, which we call MDS features, completely depend on what kind of image distance measurement is employed. With properly selected semantics-sensitive image distances, the MDS features provide rich semantic information about the images that is not captured by other feature extraction techniques. In our work, we introduce the iterated Levenberg-Marquardt algorithm for solving MDS, and study the MDS feature learning with IMage Euclidean Distance (IMED) and Spatial Pyramid Matching (SPM) distance. We present experiments on both synthetic data and real images --- the publicly accessible UIUC car image dataset. The MDS features based on SPM distance achieve exceptional performance for the car recognition task.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is: how to learn a fixed - length image feature representation based on the distance metric between images through Multidimensional Scaling (MDS) technology and apply it to object recognition tasks. Specifically, the author proposes a new feature - learning framework, which uses MDS to operate on the distances between high - level image pairs to learn the fixed - length vector representation of images. These learned features are called MDS features, and the aspects of the images they capture depend entirely on the image distance metric method used. ### Core problems and methods of the paper 1. **Problem background**: - Traditional image feature extraction methods are divided into two categories: hand - designed features and feature - learning methods. Hand - designed features mainly focus on the color, texture, and gradient information of images, while feature - learning uses the pattern distribution of the entire data set to generate a fixed - length vector representation for each image. - An important challenge in feature - learning is how to effectively encode the semantic information of images into a low - dimensional feature space while maintaining computational efficiency. 2. **Proposed methods**: - **MDS feature - learning framework**: The author proposes a multidimensional scaling feature - learning framework, in which MDS is applied to the distances between high - level image pairs to learn a fixed - length vector representation. - **Iterative Levenberg - Marquardt Algorithm (ILMA)**: To solve the MDS optimization problem, the author introduces a two - stage iterative Levenberg - Marquardt algorithm (ILMA), which minimizes the raw stress in the initialization stage and the adjustment stage respectively, thus efficiently solving the MDS problem. - **Image distance metric**: The author studies two popular image distance metric methods - IMage Euclidean Distance (IMED) and Spatial Pyramid Matching (SPM) distance, and evaluates their performance in experiments. ### Experimental results - **Synthetic data experiment**: The running time and convergence of different MDS algorithms are evaluated on the Swiss roll data set, and the results show that ILMA is faster than other methods and converges to a smaller raw stress value. - **Real - image experiment**: The UIUC car image data set is used for the car - recognition task, and the performance of MDS features is compared with that of traditional dimensionality - reduction algorithms such as PCA and kernel PCA. The experimental results show that the MDS features based on the SPM distance achieve the best classification performance in all feature dimensions, especially in the low - dimensional feature space. ### Conclusion The paper demonstrates the effectiveness of the MDS feature - learning framework, especially when combined with semantically - sensitive image distance metrics (such as SPM distance), which can significantly improve the performance of object - recognition tasks. Future work can further explore more types of image distance metrics and verify the effectiveness of this method on larger - scale data sets.