Left-censored recurrent event analysis in epidemiological studies: a proposal when the number of previous episodes is unknown

Gilma Hernández-Herrera,David Moriña,Albert Navarro
DOI: https://doi.org/10.48550/arXiv.2102.11279
2021-02-22
Abstract:Left censoring can occur with relative frequency when analysing recurrent events in epidemiological studies, especially observational ones. Concretely, the inclusion of individuals that were already at risk before the effective initiation in a cohort study, may cause the unawareness of prior episodes that have already been experienced, and this will easily lead to biased and inefficient estimates. The objective of this paper is to propose a statistical method that performs successfully in these circumstances. Our proposal is based on the use of models with specific baseline hazard, imputing the number of prior episodes when unknown, with a stratified model depending on whether the individual had or had not previously been at risk, and the use of a frailty term. The performance is examined in different scenarios through a comprehensive simulation <a class="link-external link-http" href="http://study.The" rel="external noopener nofollow">this http URL</a> proposed method achieves notable performance even when the percentage of subjects at risk before the beginning of the follow-up is very elevated, with biases that are often under 10\% and coverages of around 95\%, sometimes somewhat conservative. If the baseline hazard is constant, it seems to be that the ``Gap Time'' approach is better; if it is not constant, the ``Counting Process'' seems to be a better choice. Because of the lack of knowledge of the prior episodes that have been experienced by a part (or all) of subjects, the use of common baseline methods is not advised. Our proposal seems to perform acceptably in the majority of the scenarios proposed, becoming an interesting alternative in this context.
Methodology,Applications
What problem does this paper attempt to address?
This paper attempts to solve the left - truncation problem encountered in the analysis of recurrent events in epidemiological studies, especially in cohort studies, when some or all of the subjects are already at risk before the start of the study, but the number of previous events they have experienced is unknown. In this case, ignoring these previous histories will lead to estimation bias and inefficiency. Specifically, the paper proposes a statistical method that can successfully perform the analysis without knowing the number of events previously experienced by individuals. This method is based on using a model with a specific baseline risk to impute the unknown number of previous events, adopts a stratified model according to whether the individual has been at risk before, and introduces a frailty term to deal with the heterogeneity among individuals. Through extensive simulation studies, the performance of the proposed method in different scenarios has been verified.