Laws of large numbers and central limit theorem for Ewens-Pitman model

Claudia Contardi,Emanuele Dolera,Stefano Favaro
2024-12-16
Abstract:The Ewens-Pitman model is a distribution for random partitions of the set $\{1,\ldots,n\}$, with $n\in\mathbb{N}$, indexed by parameters $\alpha \in [0,1)$ and $\theta>-\alpha$, such that $\alpha=0$ is the Ewens model in population genetics. The large $n$ asymptotic behaviour of the number $K_{n}$ of blocks in the Ewens-Pitman random partition has been extensively investigated in terms of almost-sure and Gaussian fluctuations, which show that $K_{n}$ scales as $\log n$ and $n^{\alpha}$ depending on whether $\alpha=0$ or $\alpha\in(0,1)$, providing non-random and random limiting behaviours, respectively. In this paper, we study the large $n$ asymptotic behaviour of $K_{n}$ when the parameter $\theta$ is allowed to depend linearly on $n\in\mathbb{N}$, a non-standard asymptotic regime first considered for $\alpha=0$ in Feng (\textit{The Annals of Applied Probability}, \textbf{17}, 2007). In particular, for $\alpha\in[0,1)$ and $\theta=\lambda n$, with $\lambda>0$, we establish a law of large numbers (LLN) and a central limit theorem (CLT) for $K_{n}$, which show that $K_{n}$ scales as $n$, providing non-random limiting behaviours. Depending on whether $\alpha=0$ or $\alpha\in(0,1)$, our results rely on different arguments. For $\alpha=0$ we rely on the representation of $K_{n}$ as a sum of independent, but not identically distributed, Bernoulli random variables, which leads to a refinement of the CLT in terms of a Berry-Esseen theorem. Instead, for $\alpha\in(0,1)$, we rely on a compound Poisson construction of $K_{n}$, leading to prove LLNs, CLTs and Berry-Esseen theorems for the number of blocks of the negative-Binomial compound Poisson random partition, which are of independent interest.
Probability
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the large - n asymptotic behavior of the number of blocks $K_n$ in a random partition in the Ewens - Pitman model when the parameter $\theta$ depends linearly on $n$ (i.e., $\theta = \lambda n$, where $\lambda>0$). Specifically, the authors study the behavior of the mean and variance of $K_n$ as $n$ approaches infinity for $\alpha\in[0,1)$, and establish the strong law of large numbers (LLN) and the central limit theorem (CLT) for $K_n$. The study of these problems helps to understand the statistical properties of the number of random blocks in a non - standard asymptotic regime. ### Main contributions 1. **Asymptotic expansion**: For $\alpha\in[0,1)$, the paper derives the asymptotic expansions of the expectation and variance of $K_n$. \[ E[K_n]=n m_{\alpha,\lambda}+O(1) \] \[ \text{Var}(K_n)=n s^2_{\alpha,\lambda}+O(1) \] where \[ m_{\alpha,\lambda}=\begin{cases} \frac{\lambda}{\alpha}\left[\left(1 + \frac{1}{\lambda}\right)^\alpha - 1\right] & \text{if }\alpha\in(0,1)\\ \lambda\log\left(1 + \frac{1}{\lambda}\right) & \text{if }\alpha = 0 \end{cases} \] \[ s^2_{\alpha,\lambda}=\begin{cases} \frac{\lambda}{\alpha}\left[\left(1 + \frac{1}{\lambda}\right)^{2\alpha}\left(\frac{1-\alpha}{1+\lambda}\right)-\left(1 + \frac{1}{\lambda}\right)^\alpha\right] & \text{if }\alpha\in(0,1)\\ \lambda\log\left(1 + \frac{1}{\lambda}\right)-\frac{\lambda}{\lambda + 1} & \text{if }\alpha = 0 \end{cases} \] 2. **Strong law of large numbers (LLN)**: It is proved that \[ \frac{K_n}{n}\overset{p}{\to}m_{\alpha,\lambda} \] This means that the mean of $K_n$ converges in probability to $m_{\alpha,\lambda}$. 3. **Central limit theorem (CLT)**: It is proved that \[ \frac{K_n - n m_{\alpha,\lambda}}{\sqrt{n s^2_{\alpha,\lambda}}}\overset{d}{\to}N(0,1) \] That is, the standardized form of $K_n$ converges in distribution to the standard normal distribution. 4. **Berry - Esseen theorem**: For the case of $\alpha = 0$, by representing $K_n$ as independent but not identically distributed Bernoulli random variables, a more refined CLT, namely the Berry - Esseen inequality, is obtained: \[ \|F_n-\Phi\|_\infty\leq C\frac{\log(n)}{n^{1/8}} \] where $F_n$ is $\frac{K_n - n$