An Introduction to Deep Survival Analysis Models for Predicting Time-to-Event Outcomes

George H. Chen

2024-10-02

Abstract:Many applications involve reasoning about time durations before a critical event happens--also called time-to-event outcomes. When will a customer cancel a subscription, a coma patient wake up, or a convicted criminal reoffend? Time-to-event outcomes have been studied extensively within the field of survival analysis primarily by the statistical, medical, and reliability engineering communities, with textbooks already available in the 1970s and '80s. This monograph aims to provide a reasonably self-contained modern introduction to survival analysis. We focus on predicting time-to-event outcomes at the individual data point level with the help of neural networks. Our goal is to provide the reader with a working understanding of precisely what the basic time-to-event prediction problem is, how it differs from standard regression and classification, and how key "design patterns" have been used time after time to derive new time-to-event prediction models, from classical methods like the Cox proportional hazards model to modern deep learning approaches such as deep kernel Kaplan-Meier estimators and neural ordinary differential equation models. We further delve into two extensions of the basic time-to-event prediction setup: predicting which of several critical events will happen first along with the time until this earliest event happens (the competing risks setting), and predicting time-to-event outcomes given a time series that grows in length over time (the dynamic setting). We conclude with a discussion of a variety of topics such as fairness, causal reasoning, interpretability, and statistical guarantees. Our monograph comes with an accompanying code repository that implements every model and evaluation metric that we cover in detail.

Machine Learning

What problem does this paper attempt to address?

The paper attempts to address key challenges in time-to-event prediction, particularly how to accurately predict time-to-event outcomes in the presence of censored data. Specifically, the paper focuses on the following aspects: 1. **Handling Censored Data**: In many real-world applications, the collected data may include data points where the time-to-event outcome has not been observed (e.g., a comatose patient has not yet awakened). These data points cannot be simply excluded because they may have characteristics that explain why these events have not yet occurred or may never occur. 2. **Modeling Time-to-Event Prediction**: The paper explores how to use neural networks and other machine learning methods to predict time-to-event outcomes for individual data points. This includes classical survival analysis models (such as the Cox proportional hazards model) and modern deep learning methods (such as deep kernel Kaplan-Meier estimators and neural ordinary differential equation models). 3. **Competing Risks in Multi-Event Settings**: Beyond the basic time-to-event prediction problem, the paper also discusses how to handle competing risks in multi-event settings, i.e., predicting which key event will occur first and the time of occurrence of that event. 4. **Dynamic Time Series Inputs**: The paper also explores how to perform time-to-event prediction as time series data continuously increases, which is particularly important in fields such as healthcare and finance. By addressing these issues, the paper aims to provide readers with a comprehensive introduction to modern survival analysis, particularly how to use neural networks to build and understand time-to-event prediction models.

An Introduction to Deep Survival Analysis Models for Predicting Time-to-Event Outcomes

Deep Semisupervised Multitask Learning Model and Its Interpretability for Survival Analysis.

Deep Recurrent Survival Analysis

Deep learning for survival analysis: a review

Neural Topic Models with Survival Supervision: Jointly Predicting Time-to-Event Outcomes and Learning How Clinical Features Relate

Deep Kernel Survival Analysis and Subject-Specific Survival Time Prediction Intervals

Review of Statistical Methods for Evaluating the Performance of Survival or Other Time-to-Event Prediction Models (from Conventional to Deep Learning Approaches)

ResDeepSurv: A Survival Model for Deep Neural Networks Based on Residual Blocks and Self-attention Mechanism

Deep Neural Networks for Survival Analysis Using Pseudo Values

Machine Learning for Survival Analysis: A Survey

Time-to-event prediction using survival analysis methods for Alzheimer's disease progression

Attention-Based Deep Recurrent Model for Survival Prediction

Deep State-Space Generative Model For Correlated Time-to-Event Predictions

A tutorial on evaluating time-varying discrimination accuracy for survival models used in dynamic decision-making

Deep Generative Survival Analysis: Nonparametric Estimation of Conditional Survival Function

DNNSurv: Deep Neural Networks for Survival Analysis Using Pseudo Values

Deep Survival Analysis for Interpretable Time-Varying Prediction of Preeclampsia Risk

Survival analysis: Part I — analysis of time-to-event

High-Dimensional Survival Analysis: Methods and Applications

Survival Kernets: Scalable and Interpretable Deep Kernel Survival Analysis with an Accuracy Guarantee