Preserving friendships in school contacts: an algorithm to construct synthetic temporal networks for epidemic modelling

Lucille Calmon,Elisabetta Colosi,Giulia Bassignana,Alain Barrat,Vittoria Colizza
DOI: https://doi.org/10.1101/2024.08.20.24312288
2024-08-20
Abstract:High-resolution temporal data on contacts between hosts provide crucial information on the mixing patterns underlying infectious disease transmission. Publicly available data sets of contact data are however typically recorded over short time windows with respect to the duration of an epidemic. To inform models of disease transmission, data are thus often repeated several times, yielding synthetic data covering long enough timescales. Looping over short term data to approximate contact patterns on longer timescales can lead to unrealistic transmission chains because of the deterministic repetition of all contacts, without any renewal of the contact partners of each individual between successive periods. Real contacts indeed include a combination of regularly repeated contacts (e.g., due to friendship relations) and of more casual ones. In this paper, we propose an algorithm to longitudinally extend contact data recorded in a school setting, taking into account this dual aspect of contacts and in particular the presence of repeated contacts due to friendships. To illustrate the interest of such an algorithm, we then simulate the spread of SARS-CoV-2 on our synthetic contacts using an agent-based model specific to the school setting. We compare the results with simulations performed on synthetic data extended with simpler algorithms to determine the impact of preserving friendships in the data extension method. Notably, the preservation of friendships does not strongly affect transmission routes between classes in the school but has a clear impact on the infection pathways between individual students. Our results moreover indicate that gathering contact data during two days in a population is sufficient to generate realistic synthetic contact sequences between individuals in that population on longer timescales. The proposed tool will allow modellers to leverage existing contact data, and contributes to the design of optimal future field data collection.
What problem does this paper attempt to address?
This paper attempts to address the issue of how to more realistically extend contact data in epidemic modeling. Specifically, the researchers propose an algorithm for longitudinally extending recorded contact data in a school environment, taking into account the dual nature of contact behavior, particularly repeated contacts (such as those resulting from friendships) and incidental contacts. Currently available contact datasets typically record data for relatively short time windows, and this data needs to be extended to sufficiently long time scales to support epidemic spread models. Simply repeating short-term data can lead to unrealistic transmission chains because it does not account for changes in individual contact partners. The researchers extended the data using two methods: a Friendship-based approach and a Class-mixing-based approach. The former retains the friendship relationships between individuals and generates synthetic contact data based on these relationships; the latter only retains the mixing patterns between classes, without considering repeated contacts between individuals. To validate the effectiveness of these methods, the researchers used an agent-based model to simulate the spread of SARS-CoV-2 in schools and compared the results with data extended using a simple algorithm. The results showed that retaining friendship relationships had little impact on the transmission paths between classes but had a significant impact on the infection paths between individual students. Additionally, the study indicated that collecting contact data for 2 days in a population is sufficient to generate realistic synthetic contact sequences over long time scales between individuals. This work helps epidemic model designers utilize existing contact data and contributes to the optimized design of future data collection.