Jolien Cremers,Benjamin Kohler,Benjamin Frank Maier,Stine Nymann Eriksen,Johanna Einsiedler,Frederik Kølby Christensen,Sune Lehmann,David Dreyer Lassen,Laust Hvas Mortensen,Andreas Bjerre-Nielsen
Abstract:Social networks shape individuals' lives, influencing everything from career paths to health. This paper presents a registry-based, multi-layer and temporal network of the entire Danish population in the years 2008-2021 (roughly 7.2 mill. individuals). Our network maps the relationships formed through family, households, neighborhoods, colleagues and classmates. We outline key properties of this multiplex network, introducing both an individual-focused perspective as well as a bipartite representation. We show how to aggregate and combine the layers, and how to efficiently compute network measures such as shortest paths in large administrative networks. Our analysis reveals how past connections reappear later in other layers, that the number of relationships aggregated over time reflects the position in the income distribution, and that we can recover canonical shortest path length distributions when appropriately weighting connections. Along with the network data, we release a Python package that uses the bipartite network representation for efficient analysis.
What problem does this paper attempt to address?
This paper attempts to solve the following key problems:
1. **Introducing the time dimension to capture the dynamic changes of social networks**:
- Traditional social network research is usually based on data at a single point in time, ignoring the characteristics of individual relationships evolving over time. By introducing the concept of time - span, this paper integrates multi - year data into a unified framework, so that it can track and analyze the long - term changes of social relationships.
- The time - span not only reveals the differences in the stability of different relationship types (for example, family relationships are more stable than colleague relationships), but also shows the number of social connections accumulated by individuals over time and their distribution.
2. **Using dual perspectives to optimize the network representation of large - scale administrative data**:
- The paper proposes two different perspectives to construct social networks: the unipartite view and the bipartite view. The former directly describes the relationships between individuals, while the latter reflects the connections between individuals and specific social scenarios (such as workplaces, school classes, etc.) through "container nodes".
- By adopting the bipartite graph representation method, the paper significantly reduces the number of edges and improves the computational efficiency, especially for large - scale networks (such as colleague layers and classmate layers). This enables more efficient operations such as shortest - path calculation when dealing with large - scale networks with millions of nodes.
3. **Combining multi - layer network structures to explore the relationship between social phenomena and social outcomes**:
- Researchers have constructed a multi - layer network covering family, neighborhood, colleague, classmate and other levels, and explored the degree of overlap between these different levels and their impact on the overall network characteristics.
- The analysis shows that although there is a certain degree of overlap between each layer, each layer captures a unique pattern of social interaction. For example, the workplace often becomes the matching focus for future family formation, and high - income groups tend to have more and broader social connections.
4. **Evaluating the association between social network structures and personal life outcomes**:
- The paper finds that as the time - span increases, the correlation between an individual's degree and their income level strengthens. Specifically, within the age range of 25 to 60 years old, those with more social connections tend to have higher incomes.
- This finding emphasizes that social networks are not only the result of social behavior, but also reflect the differences in individuals' socioeconomic status, providing a new perspective for understanding social inequality.
In summary, this paper aims to reveal the dynamic characteristics of social relationships and their impact on individual life outcomes by integrating the time dimension, optimizing network representation methods, and comprehensively analyzing multi - layer social network structures.