Probability Reconstruction Based On Graph Distance Neighbor Network For Visualizing Data

Jincong Lin,Jingqi Yan
DOI: https://doi.org/10.23919/ChiCC.2017.8029120
2017-01-01
Abstract:In the era of information exploration, it is an urgent need to process and analyze high-dimensional data sets which may contain thousands of dimensions. If high-dimensional data can be unfolded in a two or three-dimensional map, we can intuitively learn about the main structure of the data, which will greatly benefit data exploration and pattern discovery. The visualization technique which aims to reveal the low-dimensional manifold embedded in the high-dimensional data can complete the challenging task above. This paper introduces a new visualization algorithm called probability reconstruction based on graph distance neighbor network (PR-GDNN). The PR-GDNN algorithm uses the graph distance to construct a neighbor network, then performs the probability reconstruction inferred from the neighborhood relationship, and finally minimizes the Kullback-Leibler divergence. Under these operations, the PR-GDNN algorithm has a better performance than the classical embedding algorithms in revealing the structure of data, especially keeping the local structure. The qualitative and quantitative comparisons on three standard data sets demonstrate the superiority of the proposed algorithm.
What problem does this paper attempt to address?