Abstract:Randomized experiments are the gold standard for investigating causal relationships, with comparisons of potential outcomes under different treatment groups used to estimate treatment effects. However, outcomes with heavy-tailed distributions pose significant challenges to traditional statistical approaches. While recent studies have explored these issues under simple randomization, their application in more complex randomization designs, such as stratified randomization or covariate-adaptive randomization, has not been adequately addressed. To fill the gap, this paper examines the properties of the estimated influence function-based M-estimator under covariate-adaptive randomization with heavy-tailed outcomes, demonstrating its consistency and asymptotic normality. Yet, the existing variance estimator tends to overestimate the asymptotic variance, especially under more balanced designs, and lacks universal applicability across randomization methods. To remedy this, we introduce a novel stratified transformed difference-in-means estimator to enhance efficiency and propose a universally applicable variance estimator to facilitate valid inferences. Additionally, we establish the consistency of kernel-based density estimation in the context of covariate-adaptive randomization. Numerical results demonstrate the effectiveness of the proposed methods in finite samples.
What problem does this paper attempt to address?
### Problems the paper attempts to solve
This paper aims to address the challenge of estimating treatment effects when dealing with outcomes having heavy - tailed distributions in covariate - adaptive randomization designs. Specifically:
1. **Limitations of traditional methods**:
- Traditional statistical methods usually assume that the potential outcomes have well - behaved characteristics, namely finite second moments. However, in fields such as economics, social sciences, and clinical trials, much data (such as payment amounts, customer spending power, CD4 counts in HIV studies, etc.) often exhibit heavy - tailed distributions, which violate the basic assumptions of traditional methods, reducing the effectiveness and reliability of traditional methods.
2. **Deficiencies in existing research**:
- Although a great deal of research has been done on heavy - tailed distributions under simple randomization in recent years, these studies mainly focus on simple randomization designs and overlook more complex randomization designs, such as stratified randomization or covariate - adaptive randomization. These complex designs are more common in practical applications, but lack effective theoretical support when dealing with heavy - tailed distributions.
3. **Main contributions of the paper**:
- This paper shows the consistency and asymptotic normality of the M - estimator based on the influence function under covariate - adaptive randomization. However, existing variance estimators often overestimate the asymptotic variance under balanced designs and are not applicable to all randomization methods. To this end, the author introduces a new stratified transformed mean - difference estimator to improve efficiency and proposes a generally applicable variance estimator for effective inference.
### Specific problems and solutions
1. **Challenges of heavy - tailed distributions**:
- Heavy - tailed distributions cause traditional methods to fail when estimating treatment effects. For example, common mean - difference estimators or regression - adjusted estimators may no longer have asymptotic normality, thus affecting the accuracy of inference.
2. **Design complexity of covariate - adaptive randomization**:
- Covariate - adaptive randomization improves the efficiency of estimation and inference by balancing baseline covariates (such as gender, age, etc.). However, this design introduces a complex dependence structure, making it difficult for traditional methods to be directly applied.
3. **New estimation methods**:
- The author proposes an M - estimator based on the influence function and proves its asymptotic properties under covariate - adaptive randomization. In addition, a stratified transformed mean - difference estimator is introduced, and its consistency and asymptotic normality are verified, and its asymptotic variance is independent of the randomization method, thus improving efficiency.
4. **Improvement in variance estimation**:
- To overcome the limitations of existing variance estimators, the author proposes a non - parametric variance estimator, which is consistent in all commonly used covariate - adaptive randomization methods and is suitable for constructing confidence intervals or conducting hypothesis tests.
### Summary
This paper addresses the challenges of dealing with heavy - tailed distribution outcomes in covariate - adaptive randomization designs by introducing new estimation methods and variance estimators, improving the accuracy and efficiency of treatment - effect estimation. The effectiveness of these methods in finite samples is also supported by numerical simulations and empirical data.