A Bayesian approach to disease clustering using restricted Chinese restaurant processes

Claudia Wehrhahn,Samuel Leonard,Abel Rodriguez,Tatiana Xifara
DOI: https://doi.org/10.1214/20-ejs1696
2020-01-01
Electronic Journal of Statistics
Abstract:Identifying disease clusters (areas with an unusually high incidence of a particular disease) is a common problem in epidemiology and public health. We describe a Bayesian nonparametric mixture model for disease clustering that constrains clusters to be made of adjacent areal units. This is achieved by modifying the exchangeable partition probability function associated with the Ewen’s sampling distribution. We call the resulting prior the Restricted Chinese Restaurant Process, as the associated full conditional distributions resemble those associated with the standard Chinese Restaurant Process. The model is illustrated using synthetic data sets and in an application to oral cancer mortality in Germany.
statistics & probability
What problem does this paper attempt to address?