A Model of Selecting the Parameters Based on the Variance of Distance Ratios for Manifold Learning Algorithms.

Lukui Shi,Qingxin Yang,Yong Xu,Pilian He
DOI: https://doi.org/10.1109/fskd.2009.471
2009-01-01
Abstract:ISOMAP, LLE, Laplacian Eigenmaps and LTSA are several representative manifold learning algorithms. In most of manifold learning methods, there are two free parameters: the neighborhood size and the intrinsic dimension of the high dimensional data set. In this paper, we analyze and compare the stress function, the residual variance and the dy-dx representation. On the basis of the dy-dx representation, a quantitative measure based on the variance of distance ratios is used to determine these two parameters, which overcomes faults of the stress function and the residual variance. Experiments show that the model can be utilized not only to choose an appropriate neighborhood size but also to estimate the intrinsic dimension of the high dimensional complex data for different manifold learning techniques.
What problem does this paper attempt to address?