Crime in Philadelphia: Bayesian Clustering with Particle Optimization

Cecilia Balocchi,Sameer K. Deshpande,Edward I. George,Shane T. Jensen
DOI: https://doi.org/10.48550/arXiv.1912.00111
2022-06-21
Abstract:Accurate estimation of the change in crime over time is a critical first step towards better understanding of public safety in large urban environments. Bayesian hierarchical modeling is a natural way to study spatial variation in urban crime dynamics at the neighborhood level, since it facilitates principled ``sharing of information'' between spatially adjacent neighborhoods. Typically, however, cities contain many physical and social boundaries that may manifest as spatial discontinuities in crime patterns. In this situation, standard prior choices often yield overly-smooth parameter estimates, which can ultimately produce mis-calibrated forecasts. To prevent potential over-smoothing, we introduce a prior that partitions the set of neighborhoods into several clusters and encourages spatial smoothness within each cluster. In terms of model implementation, conventional stochastic search techniques are computationally prohibitive, as they must traverse a combinatorially vast space of partitions. We introduce an ensemble optimization procedure that simultaneously identifies several high probability partitions by solving one optimization problem using a new local search strategy. We then use the identified partitions to estimate crime trends in Philadelphia between 2006 and 2017. On simulated and real data, our proposed method demonstrates good estimation and partition selection performance.
Applications,Methodology
What problem does this paper attempt to address?
### Problems the paper attempts to solve This paper aims to solve the problem of accurately estimating the change in crime rates in urban environments, especially the urban crime dynamics in Philadelphia. Specifically, the authors hope to improve the understanding and prediction of crime trends through the following aspects: 1. **Enhance the understanding of the spatial heterogeneity of urban crime patterns**: Traditional spatial smoothing models (such as the Conditional Autoregressive model, CAR model) may lead to over - smoothing of parameter estimates when dealing with complex urban environments with obvious geographical and social boundaries, resulting in inaccurate predictions. This paper proposes a new modeling method to prevent this over - smoothing phenomenon. 2. **Identify community clusters with different crime trends**: By dividing communities into multiple clusters, the crime trends within each cluster are similar, but there may be significant differences between different clusters. This method can help identify communities whose crime rates are different from the overall downward trend in the city, as well as areas where the baseline crime level is significantly higher or lower than that of the surrounding communities. 3. **Provide more accurate crime density estimates**: In order to compare the crime situations among different communities more accurately, the authors use the crime density based on area rather than population (the number of violent crimes per square mile). In addition, they introduce the inverse hyperbolic sine transformation to deal with the problem of skewed distribution in the data. 4. **Develop an efficient posterior optimization algorithm**: Due to the vastness of the combinatorial search space, the traditional Markov Chain Monte Carlo (MCMC) method is computationally infeasible. For this reason, the authors propose a new local search strategy that can simultaneously identify multiple high - probability partitions and achieve this by solving a single optimization problem. ### Main contributions - **Propose a "CAR - within - clusters" model**: This model allows different spatial clustering among parameters of the same type (such as baseline level and time trend), thereby better capturing the spatial heterogeneity of crime patterns. - **Introduce a new variational approximation and local optimization strategy**: This method can effectively explore the posterior distribution and identify multiple high - probability partitions without sacrificing computational efficiency. - **Apply to real - data**: By analyzing the crime data in Philadelphia from 2006 to 2017, the effectiveness and superiority of the proposed method are verified. ### Conclusion Through the above methods, the authors not only improve the understanding of the crime patterns in Philadelphia but also provide valuable reference bases for urban planning, policy - making, and law - enforcement resource allocation.