Research and Comparison of Data Dimensionality Reduction Algorithms

Qin Liu,Ran Chen,Hongming Zhu,Hongfei Fan
DOI: https://doi.org/10.1145/3135954.3135965
2017-01-01
Abstract:With the explosive growth of information data, there are more and more situations in the academic research and industrial fields where a large amount of high-dimension data to be dealt with. Consequently, this has induced tremendous difficulties for noise processing and information mining. The dimensionality reduction algorithms are playing an increasingly significant role in the process of dealing with high dimensional data. A large number of data dimensionality reduction methods have been proposed in recent years. These data reduction algorithms try to find the internal connection within data, reduce the scale of data and retain the original data information. These algorithms conduct data dimensionality reduction from varied perspectives including mapping, and data similarity, etc. As a result, there are discrepancies among the said algorithms in terms of applicable targets, computational complexity and retention of the original information. In this paper, the principle of the existing typical dimensionality reduction algorithms is expounded, and the time complexity, data information retention and the object of application are compared via experiments. The remaining problems of data reduction are summarized in the end.
What problem does this paper attempt to address?