Non-Gaussian Normative Modelling With Hierarchical Bayesian Regression

Augustijn A.A. de Boer,Johanna M. M. Bayer,Seyed Mostafa Kia,Saige Rutherford,Mariam Zabihi,Charlotte Fraza,Pieter Barkema,Lars T. Westlye,Ole A. Andreassen,Max Hinne,Christian F. Beckmann,Andre Marquand
DOI: https://doi.org/10.1101/2022.10.05.510988
2024-03-15
Abstract:Normative modelling is an emerging technique for parsing heterogeneity in clinical cohorts. This can be implemented in practice using hierarchical Bayesian regression, which provides an elegant probabilistic solution to handle site variation in a federated learning framework. However, applications of this method to date have employed a Gaussian assumption, which may be restrictive in some applications. We have extended the hierarchical Bayesian regression framework to flexibly model non-Gaussian data with heteroskdastic skewness and kurtosis. To this end, we employ a flexible distribution from the sinh-arcsinh (SHASH) family, and introduce a novel reparameterisation and a Markov chain Monte Carlo sampling approach to perform inference in this model. Using a large neuroimaging dataset collected at 82 different sites, we show that the results achieved with this extension are equivalent or better than a warped Bayesian linear regression baseline model on most datasets, whilst providing better control over the parameters governing the shape of distributions that approach is able to model. We also demonstrate that the attained flexibility is useful for accurately modelling highly nonlinear relationships between aging and imaging derived phenotypes, which shows that the extension is important for pushing the field of normative modelling forward. All methods described here are available in the open-source .
Neuroscience
What problem does this paper attempt to address?