A Regression Model for Count Data with Observation-Level Dispersion

Kimberly F. Sellers,Galit Shmueli
2009-01-01
Abstract:While Poisson regression is a popular tool for modeling count data, it is limited by its associated model assumptions. One assumption is that the re- sponse variable follows a Poisson distribution. However, over- or under-dispersion are common in practice and are not accommodated by Poisson regression. In ad- dition, the dispersion is assumed flxed across observations, whereas in practice dispersion may vary across groups or according to some other factor. Recently, Sellers and Shmueli (2008) introduced the Conway-Maxwell-Poisson (CMP) re- gression, based on the CMP distribution. CMP regression generalizes both Pois- son and logistic regression models and allows for over- or under-dispersed count data. The model structure introduced, however, assumes a flxed dispersion level across all observations. In this paper, we extend the CMP regression model to account for observation-level dispersion. We discuss model estimation, inference, diagnostics, and interpretation, and present a variable selection technique. We then compare our model to several alternatives and illustrate its advantages and usefulness using datasets with varying types and levels of dispersion.
What problem does this paper attempt to address?