How mutation accumulation depends on the structure of the cell lineage tree

Imre Derényi,Márton C. Demeter,Mario Pérez-Jiménez,Dániel Grajzel,Gergely J. Szöllősi
DOI: https://doi.org/10.1103/physreve.109.044407
IF: 2.707
2024-04-13
Physical Review E
Abstract:All the cells of a multicellular organism are the product of cell divisions that trace out a single binary tree, the so-called cell lineage tree. Because cell divisions are accompanied by replication errors, the shape of the cell lineage tree is a key determinant of how somatic evolution, which can potentially lead to cancer, proceeds. Carcinogenesis requires the accumulation of a certain number of driver mutations. By mapping the accumulation of mutations into a graph theoretical problem, we present an exact numerical method to calculate the probability of collecting a given number of mutations and show that for low mutation rates it can be approximated with a simple analytical formula, which depends only on the distribution of the lineage lengths, and is dominated by the longest lineages. Our results are crucial in understanding how natural selection can shape the cell lineage trees of multicellular organisms and curtail somatic evolution. https://doi.org/10.1103/PhysRevE.109.044407 Published by the American Physical Society under the terms of the Creative Commons Attribution 4.0 International license. Further distribution of this work must maintain attribution to the author(s) and the published article's title, journal citation, and DOI. Published by the American Physical Society
physics, fluids & plasmas, mathematical
What problem does this paper attempt to address?
The paper primarily explores how the structure of the cell lineage tree affects the accumulation of mutations, especially the accumulation of key driver mutations during the onset of cancer. The core contribution of the paper is the introduction of an accurate numerical method to calculate the probability of accumulating a specific number of mutations on a given cell lineage tree, and it provides a simple analytical formula for approximation in the case of low mutation rates. Specifically, the paper first explains that all cells in a multicellular organism are produced by a series of cell divisions, which form a binary tree structure, that is, the cell lineage tree. Due to errors in the DNA replication process, the shape of the cell lineage tree determines the rate of mutation accumulation, thereby affecting the pace and pattern of somatic evolution, which is closely related to the occurrence of cancer. Cancer requires the accumulation of a certain number of driver mutations, and the shape of the cell lineage tree determines the probability of this accumulation. By transforming the problem of mutation accumulation into a graph theory problem, the paper proposes an accurate method to calculate the probability of accumulating a specific number of mutations. For low mutation rates, researchers found that the probability can be approximated by a simple formula, which only depends on the distribution of cell lineage lengths and is mainly influenced by the longest lineage. This finding is significant for understanding how natural selection shapes the cell lineage trees of multicellular organisms and limits somatic evolution. Furthermore, the paper discusses cancer as a disease, its occurrence related to a series of genetic and epigenetic changes, especially when somatic cells begin to proliferate uncontrollably after accumulating enough driver mutations. The researchers hypothesize that, aside from harmful mutations that terminate the corresponding lineage, other mutations do not immediately affect cell proliferation dynamics until a critical number of driver mutations have accumulated. Although this assumption may not hold in some cases, it is consistent with the observation that most cancers do not have a noticeable precancerous stage and aligns with recent analyses suggesting that driver mutations often occur years or even decades before diagnosis. The researchers explored how to calculate the probability of accumulating a fixed number of mutations on a generic cell lineage tree through a mathematical model (graph theory problem). They considered the construction process of the cell lineage tree, starting from the leaf nodes, gradually merging subtrees, and eventually completing the construction of the entire tree. By iteratively applying specific merging rules, researchers were able to calculate the probability of at least m mutations occurring on any given tree. For low mutation rates, the researchers provided a simplified approximation formula, which indicates that the probability of mutation accumulation can be estimated by summing the lineage lengths of all leaf nodes, with the longest lineage length playing a dominant role. This formula reveals why minimizing the longest lineage length (e.g., through hierarchical differentiation) is crucial for reducing somatic evolution, especially the risk of cancer. In summary, the main achievements of the paper include: first, the proposal of an iterative method based on subtree merging for accurately calculating the probability of accumulating a given number of mutations on any tree; second, in the case of low mutation rates, a simple analytical formula dependent on the distribution of leaf node lineage lengths for approximating the probability, dominated by the longest lineage length. This finding emphasizes the importance of minimizing the longest lineage length to reduce the risk of cancer and also provides theoretical tools for understanding and quantifying cancer susceptibility. With the development of single-cell technologies, these theoretical results can be directly applied in the future to analyze cell lineage data at the tissue or individual level, to better understand the biological basis of cancer risk.