Abstract:Under certain general conditions, we prove that the stable central limit theorem holds in the total variation distance and get its optimal convergence rate for all $\alpha \in (0,2)$. Our method is by two measure decompositions, one step estimates, and a very delicate induction with respect to $\alpha$. One measure decomposition is light tailed and borrowed from \cite{BC16}, while the other one is heavy tailed and indispensable for lifting convergence rate for small $\alpha$. The proof is elementary and composed of the ingredients at the postgraduate level. Our result clarifies that when $\alpha=1$ and $X$ has a symmetric Pareto distribution, the optimal rate is $n^{-1}$ rather than $n^{-1} (\ln n)^2$ as conjectured in literatures.
What problem does this paper attempt to address?
The problem that this paper attempts to solve is whether the Stable Central Limit Theorem (CLT) holds under the total variation distance and its convergence rate. Specifically, the authors prove in the paper that under certain general conditions, for heavy - tailed random variables (i.e., random variables without the second moment), the stable CLT holds under the total variation distance, and the optimal convergence rate for all \(\alpha\in(0, 2)\) is obtained. In addition, they further prove that this rate is optimal by showing the lower bound of symmetric Pareto - distributed random variables. In particular, when \(\alpha = 1\) and \(X\) has a symmetric Pareto distribution, the optimal rate is \(n^{- 1}\), rather than \(n^{-1}(\ln n)^2\) as conjectured in the literature.
The main contributions of the paper are as follows:
- For the first time, it proves the existence of the stable CLT and its optimal convergence rate under the total variation distance.
- For \(\alpha = 1\) and \(X\) with a symmetric Pareto distribution, it clarifies that the optimal convergence rate is \(n^{-1}\).
- It proposes a new method, including two - measure decomposition, one - step estimation and a very fine induction method on \(\alpha\), which is particularly effective in dealing with heavy - tailed random variables.
These results are of great significance for understanding and analyzing the statistical properties of heavy - tailed data, especially in high - dimensional cases. The total variation distance is stronger than the Kolmogorov distance and thus more difficult to handle. The methods and techniques in the paper provide new tools and perspectives for future research.