Misreporting and Econometric Modelling of Zeros in Survey Data on Social Bads: an Application to Cannabis Consumption.

William Greene,Mark N. Harris,Preety Srivastava,Xueyan Zhao
DOI: https://doi.org/10.1002/hec.3553
IF: 2.395
2017-01-01
Health Economics
Abstract:When modelling "social bads," such as illegal drug consumption, researchers are often faced with a dependent variable characterised by a large number of zero observations. Building on the recent literature on hurdle and double-hurdle models, we propose a double-inflated modelling framework, where the zero observations are allowed to come from the following: nonparticipants; participant misreporters (who have larger loss functions associated with a truthful response); and infrequent consumers. Due to our empirical application, the model is derived for the case of an ordered discrete-dependent variable. However, it is similarly possible to augment other such zero-inflated models (e.g., zero-inflated count models, and double-hurdle models for continuous variables). The model is then applied to a consumer choice problem of cannabis consumption. We estimate that 17% of the reported zeros in the cannabis survey are from individuals who misreport their participation, 11% from infrequent users, and only 72% from true nonparticipants.
What problem does this paper attempt to address?