Abstract:Background: Evidence-based treatment decisions in medicine are made founded on population-level evidence obtained during randomized clinical trials. In an era of personalized medicine, these decisions should be based on the predicted benefit of a treatment on a patient-level. Survival prediction models play a central role as they incorporate the time-to-event and censoring. In medical applications uncertainty is critical especially when treatments differ in their side effect profiles or costs. Additionally, models must be adapted to local populations without diminishing performance and often without the original training data available due to privacy concern. Both points are supported by Bayesian models-yet they are rarely used. The aim of this work is to evaluate Bayesian parametric survival models on public datasets including cardiology, infectious diseases, and oncology. Materials and methods: Bayesian parametric survival models based on the Exponential and Weibull distribution were implemented as a Python package. A linear combination and a neural network were used for predicting the parameters of the distributions. A superiority design was used to assess whether Bayesian models are better than commonly used models such as Cox Proportional Hazards, Random Survival Forest, and Neural Network-based Cox Proportional Hazards. In a secondary analysis, overfitting was compared between these models. An equivalence design was used to assess whether the prediction performance of Bayesian models after model updating using Bayes rule is equivalent to retraining on the full dataset. Results: In this study, we found that Bayesian parametric survival models perform as good as state-of-the art models while requiring less hyperparameters to be tuned and providing a measure of the uncertainty of the predictions. In addition, these models were less prone to overfitting. Furthermore, we show that updating these models using Bayes rule yields equivalent performance compared to models trained on combined original and new datasets. Conclusions: Bayesian parametric survival models are non-inferior to conventional survival models while requiring less hyperparameter tuning, being less prone to overfitting, and allowing model updating using Bayes rule. Further, the Bayesian models provide a measure of the uncertainty on the statistical inference, and, in particular, on the prediction.

Bayesian shrinkage methods for partially observed data with many predictors

Bayesian estimation for longitudinal data in a joint model with HPCs

Bayesian parametric models for survival prediction in medical applications

Bayesian Modeling Longitudinal Dyadic Data with Nonignorable Dropout, with Application to a Breast Cancer Study

Bayesian Analysis of Longitudinal Dyadic Data with Informative Missing Data Using a Dyadic Shared-Parameter Model

A Bayesian hierarchical model for prediction of latent health states from multiple data sources with application to active surveillance of prostate cancer

Two-Step Mixed-Type Multivariate Bayesian Sparse Variable Selection with Shrinkage Priors

Iterative Bayesian Model Averaging: a method for the application of survival analysis to high-dimensional microarray data

Bayesian High-dimensional Linear Regression with Sparse Projection-posterior

Bayesian Variable Shrinkage and Selection in Compositional Data Regression: Application to Oral Microbiome

Adverse Subpopulation Regression for Multivariate Outcomes with High-Dimensional Predictors

Bayesian Nonparametric Variable Selection as an Exploratory Tool for Finding Genes that Matter

A Bayesian Approach to Restricted Latent Class Models for Scientifically-Structured Clustering of Multivariate Binary Outcomes

A Bayesian approach for fitting semi-Markov mixture models of cancer latency to individual-level data

High-dimensional Grouped-regression using Bayesian Sparse Projection-posterior

Tail-adaptive Bayesian shrinkage

Bayesian semiparametric joint modeling of a count outcome and inconveniently timed longitudinal predictors

Bayesian shrinkage prediction for the regression problem

Penalized Bayesian forward continuation ratio model with application to high-dimensional data with discrete survival outcomes

On the variability of regression shrinkage methods for clinical prediction models: simulation study on predictive performance