Estimation of mean using under-reported and overdispersed count data

Debjit Sengupta Surupa Roy Department of Statistics,St. Xavier's College,Kolkata,India
DOI: https://doi.org/10.1080/03610918.2024.2420262
2024-11-10
Communications in Statistics - Simulation and Computation
Abstract:Count data arising in various fields of applications are often under-reported. Ignoring undercount naturally leads to biased estimators and inaccurate confidence intervals. Further, overdispersion in count data may arise due to inherent heterogeneity of the data. Negative Binomial distribution is a viable candidate for modeling overdispersed count data. However, in presence of undercount the negative binomial model needs to be adjusted. In this paper, we shall develop likelihood-based methodologies for estimation of mean using validation data after accounting for underreporting and overdispersion. The impact of ignoring undercount on the coverage and length of the confidence intervals is investigated using extensive numerical studies. The study is supplemented with a real-life data analysis.
statistics & probability
What problem does this paper attempt to address?