Abstract:The Ewens-Pitman model is a distribution for random partitions of the set $\{1,\ldots,n\}$, with $n\in\mathbb{N}$, indexed by parameters $\alpha \in [0,1)$ and $\theta>-\alpha$, such that $\alpha=0$ is the Ewens model in population genetics. The large $n$ asymptotic behaviour of the number $K_{n}$ of blocks in the Ewens-Pitman random partition has been extensively investigated in terms of almost-sure and Gaussian fluctuations, which show that $K_{n}$ scales as $\log n$ and $n^{\alpha}$ depending on whether $\alpha=0$ or $\alpha\in(0,1)$, providing non-random and random limiting behaviours, respectively. In this paper, we study the large $n$ asymptotic behaviour of $K_{n}$ when the parameter $\theta$ is allowed to depend linearly on $n\in\mathbb{N}$, a non-standard asymptotic regime first considered for $\alpha=0$ in Feng (\textit{The Annals of Applied Probability}, \textbf{17}, 2007). In particular, for $\alpha\in[0,1)$ and $\theta=\lambda n$, with $\lambda>0$, we establish a law of large numbers (LLN) and a central limit theorem (CLT) for $K_{n}$, which show that $K_{n}$ scales as $n$, providing non-random limiting behaviours. Depending on whether $\alpha=0$ or $\alpha\in(0,1)$, our results rely on different arguments. For $\alpha=0$ we rely on the representation of $K_{n}$ as a sum of independent, but not identically distributed, Bernoulli random variables, which leads to a refinement of the CLT in terms of a Berry-Esseen theorem. Instead, for $\alpha\in(0,1)$, we rely on a compound Poisson construction of $K_{n}$, leading to prove LLNs, CLTs and Berry-Esseen theorems for the number of blocks of the negative-Binomial compound Poisson random partition, which are of independent interest.
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the large - n asymptotic behavior of the number of blocks $K_n$ in a random partition in the Ewens - Pitman model when the parameter $\theta$ depends linearly on $n$ (i.e., $\theta = \lambda n$, where $\lambda>0$). Specifically, the authors study the behavior of the mean and variance of $K_n$ as $n$ approaches infinity for $\alpha\in[0,1)$, and establish the strong law of large numbers (LLN) and the central limit theorem (CLT) for $K_n$. The study of these problems helps to understand the statistical properties of the number of random blocks in a non - standard asymptotic regime.
### Main contributions
1. **Asymptotic expansion**: For $\alpha\in[0,1)$, the paper derives the asymptotic expansions of the expectation and variance of $K_n$.
\[
E[K_n]=n m_{\alpha,\lambda}+O(1)
\]
\[
\text{Var}(K_n)=n s^2_{\alpha,\lambda}+O(1)
\]
where
\[
m_{\alpha,\lambda}=\begin{cases}
\frac{\lambda}{\alpha}\left[\left(1 + \frac{1}{\lambda}\right)^\alpha - 1\right] & \text{if }\alpha\in(0,1)\\
\lambda\log\left(1 + \frac{1}{\lambda}\right) & \text{if }\alpha = 0
\end{cases}
\]
\[
s^2_{\alpha,\lambda}=\begin{cases}
\frac{\lambda}{\alpha}\left[\left(1 + \frac{1}{\lambda}\right)^{2\alpha}\left(\frac{1-\alpha}{1+\lambda}\right)-\left(1 + \frac{1}{\lambda}\right)^\alpha\right] & \text{if }\alpha\in(0,1)\\
\lambda\log\left(1 + \frac{1}{\lambda}\right)-\frac{\lambda}{\lambda + 1} & \text{if }\alpha = 0
\end{cases}
\]
2. **Strong law of large numbers (LLN)**: It is proved that
\[
\frac{K_n}{n}\overset{p}{\to}m_{\alpha,\lambda}
\]
This means that the mean of $K_n$ converges in probability to $m_{\alpha,\lambda}$.
3. **Central limit theorem (CLT)**: It is proved that
\[
\frac{K_n - n m_{\alpha,\lambda}}{\sqrt{n s^2_{\alpha,\lambda}}}\overset{d}{\to}N(0,1)
\]
That is, the standardized form of $K_n$ converges in distribution to the standard normal distribution.
4. **Berry - Esseen theorem**: For the case of $\alpha = 0$, by representing $K_n$ as independent but not identically distributed Bernoulli random variables, a more refined CLT, namely the Berry - Esseen inequality, is obtained:
\[
\|F_n-\Phi\|_\infty\leq C\frac{\log(n)}{n^{1/8}}
\]
where $F_n$ is $\frac{K_n - n$