CN-Isomap Algorithm for Nonlinear Dimensionality Reduction of Sparse Data

WU Sen,QUAN Xi-wei,CHEN Xue-chang
2010-01-01
Abstract:An improved algorithm Cut-Neighbors Isometric feature mapping(CN-Isomap) is proposed after analyzing why Isomap,a classic manifold learning algorithm,can not reduce dimensionality effectively for sparse nonlinear data.The algorithm first identifies the 'manifold neighbors' effectively when data is sparse in order to delete the 'short circuit' edges in neighborhood graph.Then it simulates the geodesic distance by shortest path algorithm,so that the geodesic distance will not deviate from the manifold region.Thus low-dimensional embedded mapping can correctly reflect the inherent topological features of sample points in high-dimensional input space,which enables that the algorithm finds the low-dimensional manifolds implicated in high-dimensional space better and reduce dimensionality of sparse nonlinear data effectively.The effectiveness of the algorithm is verified by experiment on Benchmark data set.CN-Isomap is the extension of Isomap.It is not only effective for dimensionality reduction of sparse nonlinear data,but also applicable to non-sparse data.
What problem does this paper attempt to address?