Dimension Reduction Based on Sampling

Zhuping Li,Di Yang,Mengmeng Li,Haifeng Guo,T. Ye,Hongzhi Wang
DOI: https://doi.org/10.1007/978-981-99-5968-6_15
2023-01-01
Abstract:Dimension reduction provides a powerful means of reducing the number of random variables under consideration. However, there were many similar tuples in large datasets, and before reducing the dimension of the dataset, we removed some similar tuples to retain the main information of the dataset while accelerating the dimension reduction. Accordingly, we propose a dimension reduction technique based on biased sampling, a new procedure that incorporates features of both dimensional reduction and biased sampling to obtain a computationally efficient means of reducing the number of random variables under consideration. In this paper, we choose Principal Components Analysis(PCA) as the main dimensional reduction algorithm to study, and we show how this approach works.
What problem does this paper attempt to address?