Unsupervised dimensionality reduction based on fractal dimension and genetic algorithm

YAN Guang-hui,LI Zhan-huai
DOI: https://doi.org/10.3778/j.issn.1002-8331.2008.10.007
2008-01-01
Abstract:Dimensionality reduction is the powerful method to tackle the"Curse of Dimensionality".Genetic algorithms based feature subset selection technique is superior to traditional feature selection method in the dimensionality reduction of the high dimensional data set.However,it can not be used in the field of unsupervised learning such as clustering which has no class label to use.FDR(Fractal Dimensionality Reduction) is the new unsupervised feature selection method.But,it is infeasible in practice in the high dimensional data set for its multiple scanning of the data set and high time consume.Accordingly,the authors propose the GABUFSS(Genetic Algorithm Based Unsupervised Feature Subset Selection) algorithm which combines the genetic algorithm and the fractal dimensionality reduction technique to tackle the unsupervised feature subset selection problem in the high dimensional data set.The experimental results using synthetic and real life data set show that GABUFSS algorithm achieves better performance than FDR algorithm in the high dimensional data set and can find identical subsets additionally.
What problem does this paper attempt to address?