Early-Phase Local-Area Model for Pandemics Using Limited Data: A SARS-CoV-2 Application

Jiasheng Shi,Jeffrey S. Morris,David M. Rubin,Jing Huang
DOI: https://doi.org/10.48550/arXiv.2212.08282
2024-03-19
Abstract:The emergence of novel infectious agents presents challenges to statistical models of disease transmission. These challenges arise from limited, poor-quality data and an incomplete understanding of the agent. Moreover, outbreaks manifest differently across regions due to various factors, making it imperative for models to factor in regional specifics. In this work, we offer a model that effectively utilizes constrained data resources to estimate disease transmission rates at the local level, especially during the early outbreak phase when primarily infection counts and aggregated local characteristics are accessible. This model merges a pathogen transmission methodology based on daily infection numbers with regression techniques, drawing correlations between disease transmission and local-area factors, such as demographics, health policies, behavior, and even climate, to estimate and forecast daily infections. We incorporate the quasi-score method and an error term to navigate potential data concerns and mistaken assumptions. Additionally, we introduce an online estimator that facilitates real-time data updates, complemented by an iterative algorithm for parameter estimation. This approach facilitates real-time analysis of disease transmission when data quality is suboptimal and knowledge of the infectious pathogen is limited. It is particularly useful in the early stages of outbreaks, providing support for local decision-making.
Methodology,Applications
What problem does this paper attempt to address?
This paper attempts to solve the problem of how to effectively estimate and predict the disease transmission rate in the early stage of the outbreak of a new infectious disease, when data is limited and of low quality, and there is insufficient understanding of the pathogen. Specifically, the researchers proposed a model for estimating the disease transmission rate at the local level using limited data resources, especially in the early stage of the epidemic, when the mainly available data are the number of infected people and aggregated local characteristics. The model combines the pathogen transmission methodology based on the daily number of infections with regression techniques to estimate and predict the daily number of infections by correlating disease transmission with local factors such as demographics, health policies, behaviors, and even climate. In addition, the model also introduces the quasi - score method and error terms to deal with potential data problems and false assumptions, and proposes an online estimator and an iterative algorithm to support real - time data analysis, even when the data quality is not ideal and knowledge of the infectious pathogen is limited. This method is particularly applicable in the early stage of the epidemic and can support local decision - making.