Abstract:Background/aims: After completion of a randomised controlled trial, an extended follow-up period may be initiated to learn about longer term impacts of the intervention. Since extended follow-up studies often involve additional eligibility restrictions and consent processes for participation, and a longer duration of follow-up entails a greater risk of participant attrition, missing data can be a considerable threat in this setting. As a potential source of bias, it is critical that missing data are appropriately handled in the statistical analysis, yet little is known about the treatment of missing data in extended follow-up studies. The aims of this review were to summarise the extent of missing data in extended follow-up studies and the use of statistical approaches to address this potentially serious problem. Methods: We performed a systematic literature search in PubMed to identify extended follow-up studies published from January to June 2015. Studies were eligible for inclusion if the original randomised controlled trial results were also published and if the main objective of extended follow-up was to compare the original randomised groups. We recorded information on the extent of missing data and the approach used to treat missing data in the statistical analysis of the primary outcome of the extended follow-up study. Results: Of the 81 studies included in the review, 36 (44%) reported additional eligibility restrictions and 24 (30%) consent processes for entry into extended follow-up. Data were collected at a median of 7 years after randomisation. Excluding 28 studies with a time to event primary outcome, 51/53 studies (96%) reported missing data on the primary outcome. The median percentage of randomised participants with complete data on the primary outcome was just 66% in these studies. The most common statistical approach to address missing data was complete case analysis (51% of studies), while likelihood-based analyses were also well represented (25%). Sensitivity analyses around the missing data mechanism were rarely performed (25% of studies), and when they were, they often involved unrealistic assumptions about the mechanism. Conclusion: Despite missing data being a serious problem in extended follow-up studies, statistical approaches to addressing missing data were often inadequate. We recommend researchers clearly specify all sources of missing data in follow-up studies and use statistical methods that are valid under a plausible assumption about the missing data mechanism. Sensitivity analyses should also be undertaken to assess the robustness of findings to assumptions about the missing data mechanism.

When and how to split the follow-up time in the analysis of epidemiological or clinical studies with follow-ups

Multiple imputation for longitudinal data: A tutorial

Longitudinal Data with Follow-up Truncated by Death: Match the Analysis Method to Research Aims

A Tutorial on Multilevel Survival Analysis: Methods, Models and Applications

Marginal analysis of longitudinal count data in long sequences: Methods and applications to a driving study

Time-varying Covariates and Coefficients in Cox Regression Models

Comparison of nested case-control and survival analysis methodologies for analysis of time-dependent exposure

Reshaping and Aggregating Data: an Introduction to Reshape Package.

Survival analysis: Part I — analysis of time-to-event

Treatment of missing data in follow-up studies of randomised controlled trials: A systematic review of the literature

Sequence Analysis as an approach to characterize variables that unfold over time: implementation and practical considerations for epidemiologists

Studying Continuous, Time-varying, and/or Complex Exposures Using Longitudinal Modified Treatment Policies

The optimal pre-post allocation for randomized clinical trials

The Impact of Time Series Length and Discretization on Longitudinal Causal Estimation Methods

Long-term effect estimation when combining clinical trial and observational follow-up datasets

Repeated Measures Designs and Analysis of Longitudinal Data: If at First You Do Not Succeed—Try, Try Again

Advanced considerations in survival analysis

Efficient designs and analysis of two-phase studies with longitudinal binary data

Dealing with Treatment-Confounder Feedback and Sparse Follow-up in Longitudinal Studies: Application of a Marginal Structural Model in a Multiple Sclerosis Cohort

Accounting for informative observation process in transition models of binary longitudinal outcome: Application to medical record data