Robust Inference for High-dimensional Linear Models with Heavy-tailed Errors via Partial Gini Covariance

Yilin Zhang,Songshan Yang,Yunan Wu,Lan Wang
2024-11-19
Abstract:This paper introduces the partial Gini covariance, a novel dependence measure that addresses the challenges of high-dimensional inference with heavy-tailed errors, often encountered in fields like finance, insurance, climate, and biology. Conventional high-dimensional regression inference methods suffer from inaccurate type I errors and reduced power in heavy-tailed contexts, limiting their effectiveness. Our proposed approach leverages the partial Gini covariance to construct a robust statistical inference framework that requires minimal tuning and does not impose restrictive moment conditions on error distributions. Unlike traditional methods, it circumvents the need for estimating the density of random errors and enhances the computational feasibility and robustness. Extensive simulations demonstrate the proposed method's superior power and robustness over standard high-dimensional inference approaches, such as those based on the debiased Lasso. The asymptotic relative efficiency analysis provides additional theoretical insight on the improved efficiency of the new approach in the heavy-tailed setting. Additionally, the partial Gini covariance extends to the multivariate setting, enabling chi-square testing for a group of coefficients. We illustrate the method's practical application with a real-world data example.
Methodology,Statistics Theory
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to conduct robust statistical inference in high - dimensional linear models when the error distribution has heavy - tailed characteristics. Specifically, traditional high - dimensional regression inference methods have problems of inaccurate type I error rates and decreased test power when dealing with heavy - tailed errors, which limit their effectiveness in fields such as finance, insurance, climate, and biology. These problems mainly stem from estimation biases, imprecise inferences, and unreliable risk assessments caused by heavy - tailed errors. To meet this challenge, the author proposes the partial Gini covariance, a new method for measuring dependence. This method constructs a robust statistical inference framework with the following characteristics: 1. **Handling heavy - tailed random errors**: The partial Gini covariance can effectively handle a series of heavy - tailed random errors, including the Cauchy distribution, which has received relatively little attention in the existing literature. 2. **Simplifying the need for parameter tuning**: The implementation of this method is simple and requires minimal parameter tuning work, avoiding the complexity of selecting regularization parameters in high - dimensional regression. 3. **No need to estimate the random error density function**: The partial Gini covariance method can perform effective inference without estimating the random error density function, thereby improving computational feasibility and robustness. Through extensive simulation experiments, the author demonstrates the superior performance of this method over standard high - dimensional inference methods (such as de - biased Lasso). In particular, in the setting of heavy - tailed errors, the asymptotic relative efficiency analysis of the new method provides theoretical evidence of improvement. In addition, the partial Gini covariance has been extended to multivariate settings, supporting chi - square tests for a set of coefficients. In summary, this paper aims to provide a robust statistical inference method for handling heavy - tailed errors in high - dimensional data by introducing the partial Gini covariance, thereby improving the accuracy and reliability of inference.