Hierarchical Bayesian modeling of multi-region brain cell count data

Sydney Dimmock,Benjamin M.S. Exley,Gerald Moore,Lucy Menage,Alessio Delogu,Simon R Schultz,E Clea Warburton,Conor J Houghton,Cian O'Donnell
DOI: https://doi.org/10.1101/2024.07.20.603979
2024-07-21
Abstract:We can now collect cell-count data across whole animal brains quantifying recent neuronal activity, gene expression, or anatomical connectivity. This is a powerful approach since it is a multi-region measurement, but because the imaging is done post-mortem, each animal only provides one set of counts. Experiments are expensive and since cells are counted by imaging and aligning a large number of brain sections, they are time-intensive. The resulting datasets tend to be under-sampled with fewer animals than brain regions. As a consequence, these data are a challenge for traditional statistical approaches. We demonstrate that hierarchical Bayesian methods are well suited to these data by presenting a 'standard' partially-pooled Bayesian model for multi-region cell-count data and applying it to two example datasets. For both datasets the Bayesian model outperformed standard parallel t-tests. Overall, the Bayesian approach's ability to capture nested data and its rigorous handling of uncertainty in under-sampled data can substantially improve inference for cell-count data.
Neuroscience
What problem does this paper attempt to address?
This paper attempts to address the challenges of conducting statistical analysis in multi - regional brain cell count data. Specifically, the experiments involved in the research are usually very expensive and time - consuming, as it is necessary to image brain slices of each animal and align them with a standardized brain atlas to quantify the number of specific cells in multiple brain regions. Due to the complexity and cost of these experiments, the number of animals in each experimental group is often small, while the number of brain regions to be analyzed is large, resulting in the data sets being usually under - sampled. This "wide but shallow" data structure poses challenges to traditional statistical methods, especially when dealing with high variability and outliers. To solve these problems, the authors propose a method based on hierarchical Bayesian modeling to analyze multi - regional brain cell count data. This method can effectively handle the nested structure in the data and balance the contributions between individual observations and group estimates through the partial pooling technique, thereby improving the estimation accuracy of parameters. In addition, the Bayesian method can also provide a measure of the uncertainty of parameter estimates, which is crucial for understanding the variability in the data and making more reliable inferences. Specifically, the paper shows how to construct a standard partial pooling Bayesian model and apply it to two example data sets, which are related to the studies of neuronal activation and developmental lineages respectively. The results show that, compared with the traditional parallel t - test, the Bayesian model exhibits better performance in both data sets, especially when dealing with data with small sample sizes and high variability.