The $B_2$ index of galled trees

François Bienvenu,Jean-Jil Duchamps,Michael Fuchs,Tsan-Cheng Yu
2024-07-28
Abstract:In recent years, there has been an effort to extend the classical notion of phylogenetic balance, originally defined in the context of trees, to networks. One of the most natural ways to do this is with the so-called $B_2$ index. In this paper, we study the $B_2$ index for a prominent class of phylogenetic networks: galled trees. We show that the $B_2$ index of a uniform leaf-labeled galled tree converges in distribution as the network becomes large. We characterize the corresponding limiting distribution, and show that its expected value is 2.707911858984... This is the first time that a balance index has been studied to this level of detail for a random phylogenetic network. One specificity of this work is that we use two different and independent approaches, each with its advantages: analytic combinatorics, and local limits. The analytic combinatorics approach is more direct, as it relies on standard tools; but it involves slightly more complex calculations. Because it has not previously been used to study such questions, the local limit approach requires developing an extensive framework beforehand; however, this framework is interesting in itself and can be used to tackle other similar problems.
Populations and Evolution,Combinatorics,Probability
What problem does this paper attempt to address?
The paper primarily focuses on studying the statistical behavior of a metric called the B2 index on a specific type of evolutionary network known as "galled trees." The B2 index was originally defined as a balance index in phylogenetic trees to quantify the degree of symmetry in the tree. As phylogenetic networks become increasingly important in describing evolutionary history, extending the original balance index to these networks has become a significant research direction. Specifically, the main contributions of the paper include: 1. **Study of the B2 Index**: - The paper is the first to thoroughly investigate the distribution of the B2 index on random evolutionary networks. - For uniform leaf-labeled galled trees, it is proven that the distribution of the B2 index converges as the network size increases, and the characteristics of the limiting distribution are provided. - In particular, the expected value converges to a constant \(c = 2.707911858984...\). 2. **Two Different Research Methods**: - **Analytic Combinatorics**: This method is more straightforward but requires complex calculations. - **Local Limit Method**: This method requires establishing a substantial theoretical framework but provides a general approach to solving such problems, with relatively simple numerical calculations. 3. **Methodological Innovations**: - The study uses two completely independent methods to verify the consistency of the results, each with its own advantages. - The analytic combinatorics method utilizes existing mathematical tools, while the local limit method requires the development of new theoretical frameworks, but the latter can be applied to other similar problems. In summary, this paper addresses the specific problem of calculating the mathematical expectation of the B2 index in random evolutionary networks and demonstrates two complementary methods to tackle such problems. Additionally, it lays the groundwork for studying the behavior of other balance indices in evolutionary networks.