An improved nonlinear correlation method for feature selection of complex data

Du Shang,Ang Li,Pengjian Shang
DOI: https://doi.org/10.1007/s11071-023-08406-w
IF: 5.741
2023-01-01
Nonlinear Dynamics
Abstract:In this paper, the affine-invariant Gini distance correlation/covariance is put forward to characterize the dependence between categorical and numerical variables more effectively. Furtherly, the useful features from analyzing subjects are extracted for the establishment of a reliable discrimination model, and proper measurement of correlation is built. The affine-invariant Gini distance measures are not only orthogonal invariant, but also affine invariant, which is important when considering the preservation of the equivalent statistical inference. More importantly, the Gini distance measurements capture nonlinear dependence as well as independence, which is more superb than the existing methods. The affine-invariant Gini distance estimators are easy to calculate, and the estimation of probability density of variables is not required. Besides, it can be easily extended to the reproducing kernel Hilbert spaces. The simulation- and reality-based experiments illustrate that the improved nonlinear correlation method is more effective in measuring dependencies and selecting useful features when compared to other methods.
What problem does this paper attempt to address?