Inferring language dispersal patterns with velocity field estimation

Sizhe Yang,Xiaoru Sun,Li Jin,Menghan Zhang
DOI: https://doi.org/10.1038/s41467-023-44430-5
IF: 16.6
2024-01-02
Nature Communications
Abstract:Abstract Reconstructing the spatial evolution of languages can deepen our understanding of the demic diffusion and cultural spread. However, the phylogeographic approach that is frequently used to infer language dispersal patterns has limitations, primarily because the phylogenetic tree cannot fully explain the language evolution induced by the horizontal contact among languages, such as borrowing and areal diffusion. Here, we introduce the language velocity field estimation, which does not rely on the phylogenetic tree, to infer language dispersal trajectories and centre. Its effectiveness and robustness are verified through both simulated and empirical validations. Using language velocity field estimation, we infer the dispersal patterns of four agricultural language families and groups, encompassing approximately 700 language samples. Our results show that the dispersal trajectories of these languages are primarily compatible with population movement routes inferred from ancient DNA and archaeological materials, and their dispersal centres are geographically proximate to ancient homelands of agricultural or Neolithic cultures. Our findings highlight that the agricultural languages dispersed alongside the demic diffusions and cultural spreads during the past 10,000 years. We expect that language velocity field estimation could aid the spatial analysis of language evolution and further branch out into the studies of demographic and cultural dynamics.
multidisciplinary sciences
What problem does this paper attempt to address?
### What problems does this paper attempt to solve? This paper aims to solve several key problems in the reconstruction of language diffusion patterns, especially the limitations encountered when using the traditional phylogeographic approach. Specifically: 1. **Limitations of traditional methods**: - The phylogeographic approach mainly relies on constructing the phylogenetic tree of languages. Although this method can well represent the vertical evolutionary relationships between languages (such as branching and differentiation), it cannot fully explain language evolution caused by horizontal contact, such as language borrowing and regional diffusion. - These limitations lead to an incomplete understanding of the spatial evolution of languages, especially when it comes to complex interactions in multilingual regions. 2. **Proposed new method**: - The paper introduces a new computational method - **Language Velocity Field Estimation (LVF)**. This method does not rely on the phylogenetic tree but infers language diffusion trajectories and centers by establishing a velocity field. - The velocity field can capture the diachronic evolution trajectories of language features and also reflect the influence of horizontal contact, thus more comprehensively describing the spatio - temporal evolution of languages. 3. **Verification and application**: - The researchers verified the effectiveness and robustness of LVF through simulated data sets and empirical data. - In empirical applications, LVF was used to infer the diffusion patterns of four agricultural language families (Indo - European, Sino - Tibetan, Bantu, and Arawak) and explored the interdisciplinary consistency between these language diffusions and ancient population migrations and cultural diffusions. 4. **Research significance**: - This method helps to deepen our understanding of human population migrations and cultural diffusions in the past 10,000 years, especially how agricultural languages spread along with population migrations and cultural diffusions. - LVF provides a new tool that can further analyze language evolution and its historical connection with human activities, providing strong support for interdisciplinary research. Through these efforts, the paper hopes to provide a more comprehensive and accurate method for the study of language spatial evolution, so as to better understand the complex interactions between languages, cultures, and populations.