EpiFusion: Joint inference of the effective reproduction number by integrating phylodynamic and epidemiological modelling with particle filtering
Ciara Judge,Timothy Vaughan,Timothy Russell,Sam Abbott,Louis du Plessis,Tanja Stadler,Oliver Brady,Sarah Hill
DOI: https://doi.org/10.1371/journal.pcbi.1012528
2024-11-13
PLoS Computational Biology
Abstract:Accurately estimating the effective reproduction number (R t ) of a circulating pathogen is a fundamental challenge in the study of infectious disease. The fields of epidemiology and pathogen phylodynamics both share this goal, but to date, methodologies and data employed by each remain largely distinct. Here we present EpiFusion: a joint approach that can be used to harness the complementary strengths of each field to improve estimation of outbreak dynamics for large and poorly sampled epidemics, such as arboviral or respiratory virus outbreaks, and validate it for retrospective analysis. We propose a model of R t that estimates outbreak trajectories conditional upon both phylodynamic (time-scaled trees estimated from genetic sequences) and epidemiological (case incidence) data. We simulate stochastic outbreak trajectories that are weighted according to epidemiological and phylodynamic observation models and fit using particle Markov Chain Monte Carlo. To assess performance, we test EpiFusion on simulated outbreaks in which transmission and/or surveillance rapidly changes and find that using EpiFusion to combine epidemiological and phylodynamic data maintains accuracy and increases certainty in trajectory and R t estimates, compared to when each data type is used alone. We benchmark EpiFusion's performance against existing methods to estimate R t and demonstrate advances in speed and accuracy. Importantly, our approach scales efficiently with dataset size. Finally, we apply our model to estimate R t during the 2014 Ebola outbreak in Sierra Leone. EpiFusion is designed to accommodate future extensions that will improve its utility, such as explicitly modelling population structure, accommodations for phylogenetic uncertainty, and the ability to weight the contributions of genomic or case incidence to the inference. Understanding infectious disease spread is fundamental to protecting public health, but can be challenging as disease spread is a phenomenon that cannot be directly observed. So, epidemiologists use data in conjunction with mathematical models to estimate disease dynamics. Often, combinations of different models and data can be used to answer the same questions–for example 'traditional' epidemiology commonly uses case incidence data (the number of people who have tested positive for a disease during a certain time period) whereas phylodynamic models use pathogen genomic sequence data and our knowledge of the way their genomes evolve to model disease population dynamics. Each of these approaches have strengths and limitations, and data of each type can be sparse or biased, particularly during rapidly developing outbreaks or in countries with poor pathogen surveillance. An increasing number of approaches attempt to fix this problem by incorporating diverse concepts and data types together in their models. We aim to contribute to this movement by introducing EpiFusion, a modelling framework that improves the efficiency and precision at which we can monitor important changes in pathogen transmission (specifically, in the effective reproduction number). EpiFusion uses particle filtering to simulate epidemic trajectories over time and weight their likelihood according to both case incidence data and a phylogenetic tree using separate observation models, resulting in the inference of trajectories in agreement with both sets of data. Improvements in our ability to accurately and confidently model pathogen spread help us to respond to infectious disease outbreaks and improve public health.
biochemical research methods,mathematical & computational biology