Asymptotic normality of a modified estimator of Gini distance correlation

Sang, Yongli
DOI: https://doi.org/10.1007/s00362-024-01575-9
2024-06-07
Statistical Papers
Abstract:Recently, the Gini distance correlation (GDC), , was proposed to measure dependence between numerical and categorical variables (Dang et al. 2021). This new dependence measure can mutually characterize independence between the random variables. That is, if and only only the categorical variable and the numerical variable are independent. Limiting distributions of the naive estimator of GDC have been established in Dang et al. (2021). It has been shown that under independence, the empirical GDC admits a degenerating limit which is an infinite weighted sum of Chi-squared distributions. In this paper, we propose a modified estimator of the GDC that is asymptotically normal under independence between the numerical and the categorical variables. We also extend this method to the generalized GDC Zhang et al. (2019) in reproducing kernel Hilbert space (RKHS). Both the modified GDC and generalized GDC can be applied to test the K -sample problem. Simulations studies are conducted to examine the finite sample performance of the new K -sample test based on the modified estimators.
statistics & probability
What problem does this paper attempt to address?