The Critical Beta-splitting Random Tree IV: Mellin analysis of Leaf Height

David Aldous,Svante Janson
2024-12-17
Abstract:In the critical beta-splitting model of a random $n$-leaf rooted tree, clades are recursively split into sub-clades, and a clade of $m$ leaves is split into sub-clades containing $i$ and $m-i$ leaves with probabilities $\propto 1/(i(m-i))$. The height of a uniform random leaf can be represented as the absorption time of a certain {\em harmonic descent} Markov chain. Recent work on these heights $D_n$ and $L_n$ (corresponding to discrete or continuous versions of the tree) has led to quite sharp expressions for their asymptotic distributions, based on their Markov chain description. This article gives even sharper expressions, based on an $n \to \infty$ limit tree structure described via exchangeable random partitions in the style of Haas et al (2008). Within this structure, calculations of moments lead to expressions for Mellin transforms, and then via Mellin inversion we obtain sharp estimates for the expectation, variance, Normal approximation and large deviation behavior of $D_n$.
Probability,Complex Variables
What problem does this paper attempt to address?
The problem that this paper attempts to solve is about the exact asymptotic analysis of leaf heights in the random tree model. Specifically, the author studies the leaf height distribution in the critical beta - splitting random tree model, especially obtaining more accurate asymptotic expressions through the Mellin transform technique. ### Main problem description In the critical beta - splitting model, a random tree with \(n\) leaves is constructed by recursively splitting clades into sub - clades. Each clade containing \(m\) leaves is split into two sub - clades containing \(i\) and \(m - i\) leaves with probability \(\propto\frac{1}{i(m - i)}\). The height of a uniformly randomly selected leaf in the tree can be represented as the absorption time of a certain harmonic descent Markov chain. Recent studies have given relatively accurate expressions for the asymptotic distributions of these heights, but this paper aims to provide more accurate expressions. ### Key problems 1. **Expected value of leaf height**: - Leaf jump height \(D_{n}\) in the discrete - time model (DTCS). - Leaf height \(L_{n}\) in the continuous - time model (CTCS). 2. **Higher - order moments**: - Higher - order moments of \(D_{n}\), such as variance, central limit theorem and large - deviation behavior. 3. **Asymptotic properties of tree structure**: - Limit tree structure as \(n\rightarrow\infty\) described by exchangeable random partitions. - Calculate moments using the Mellin transform and derive asymptotic estimates. ### Specific problems and solutions #### 1. Expected value of leaf height - For the discrete - time model (DTCS), the expected value of the leaf jump height \(D_{n}\) has the following asymptotic expansion: \[ E[D_{n}]\sim\frac{6}{\pi^{2}}\log n+\sum_{i = 0}^{\infty}c_{i}n^{-i}+\sum_{j = 1}^{\infty}\sum_{k = 1}^{\infty}c_{j,k}n^{-|s_{j}|-k} \] where \(c_{0}=\frac{\zeta(3)}{\zeta(2)^{2}}+\frac{\gamma}{\zeta(2)}\approx0.795155660439\), \(c_{1}=-\frac{3}{\pi^{2}}\) - For the continuous - time model (CTCS), the expected value of the leaf height \(L_{n}\) has the following asymptotic expansion: \[ E[L_{n}]\sim\frac{3}{\pi^{2}}(\log n)^{2}+\left(\frac{\zeta(3)}{\zeta(2)^{2}}+\frac{\gamma}{\zeta(2)}\right)\log n + b_{0}+\sum_{k = 1}^{\infty}a_{k}n^{-k}\log n+\sum_{k = 1}^{\infty}b_{k}n^{-k}+\sum_{j = 1}^{\infty}\sum_{k = 1}^{\infty}c_{j,k}n^{-|s_{j}|-k} \] where \(b_{0}=\frac{3\gamma^{2}}{\pi^{2}}+\frac{\zeta(3)\gamma}{\zeta(2)^{2}}+\frac{\zeta(3)^{2}}{\zeta(2)^{3}}+\frac{1}{10}\approx0.78234\) #### 2. Higher - order moments - The variance of \(D_{n}\) has the following asymptotic expansion: \[ \text{var}(D_{n})=\frac{2\zeta(3)}{\zeta(2)^{3}}\log n+\frac{2\zeta(3)}{\zeta(2