Nonparametric feature selection

E. Patrick,F. P. Fischer
DOI: https://doi.org/10.1109/TIT.1969.1054354
IF: 2.5
1969-09-01
IEEE Transactions on Information Theory
Abstract:Two groups of L -dimensional observations of size N_{1} and N_{2} are known to be random vector variables from two unknown probability distribution functions [1]. A method is discussed for obtaining an l -dimensional linear subspace of the observation space in which the l -variate marginal distributions are most separated, based on a nonparametric estimate of probability density functions and a distance criterion. The distance used essentially is the L_{2} norm of the difference between Parzen estimates of the two densities. An algorithm is developed that determines the subspace for which the distance between the two densities is maximized. Computer simulations are performed.
What problem does this paper attempt to address?